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Knowing and guessing, these are two essential epistemological pillars in the theory of quantum- 
mechanical measurement. As formulated quantum mechanics is a statistical theory. In general, a 
priori unknown states can be completely determined only when measurements on infinite ensembles 
of identically prepared quantum systems are performed. But how one can estimate (guess) quantum 
state when just incomplete data are available (known)? What is the most reliable estimation based 
on a given measured data? What is the optimal measurement providing only a finite number of 
identically prepared quantum objects are available? These are some of the questions we address in 
the article. 

We present several schemes for a reconstruction of states of quantum systems from measured 
data: 

(1) We show how the maximum entropy (MaxEnt) principle can be efficiently used for an estimation 
of quantum states (i.e. density operators or Wigner functions) on incomplete observation levels, 
when just a fraction of system observables are measured (i.e., the mean values of these observables 
are known from the measurement). With the extention of observation levels more reliable estimation 
of quantum states can be performed. In the limit, when all system observables (i.e., the quorum 
of observables) are measured, the MaxEnt principle leads to a complete reconstruction of quantum 
states, i.e. quantum states are uniquely determined. We analyze the reconstruction via the MaxEnt 
principle of bosonic systems (e.g. single-mode electromagnetic fields modeled as harmonic oscillators) 
as well as spin systems. We present results of MaxEnt reconstruction of Wigner functions of various 
nonclassical states of light in different observation levels. We also present results of numerical 
simulations which illustrate how the MaxEnt principle can be efficiently applied for a reconstruction 
of quantum states from incomplete tomographic data. 

(2) When only a. finite number of identically prepared systems are measured, then the measured data 
contain only information about frequencies of appearances of eigenstates of certain observables. We 
show that in this case states of quantum systems can be estimated with the help of quantum Bayesian 
inference. We analyze the connection between this reconstruction scheme and the reconstruction via 
the MaxEnt principle in the limit of infinite number of measurements. We discuss how an a priori 
knowledge about the state which is going to be reconstructed can be utilized in the estimation 
procedure. In particular, we discuss in detail the difference between the reconstruction of states 
which are a priori known to be pure or impure. 

(3) We show how to construct the optimal generalized measurement of a finite number of identically 
prepared quantum systems which results in the estimation of a quantum state with the highest 
fidelity. We show how this optimal measurement can in principle be realized. We analyze two 
physically interesting examples - a reconstruction of states of a spin- 1/2 and an estimation of phase 
shifts. 
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I. INTRODUCTION: MEASUREMENT OF QUANTUM STATES 



The concept of a quantum state represents one of the most fundamental pillars of the paradigm of quantum theory 
. Contrary to its mathematical elegance and convenience in calculations, the physical interpretation of a quantum 
state is not so transparent. The problem is that the quantum state (described either by a state vector, or density 
operator or a phase-space probability density distribution) does not have a well defined objective status, i.e. a 
state vector is not an objective property of a particle. According to Peres (see p. 374): "There is no physical 
evidence whatsoever that every physical system has at every instant a well defined state... In strict interpretation 
of quantum theory these mathematical symbols [i.e., state vectors] represent statistical information enabling us to 
compute the probabilities of occurrence of specific events." Once this point of view is adopted then it becomes clear 
that any "measurement" or a reconstruction of a density operator (or its mathematical equivalent) can be understood 
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exclusively as an expression of our knowledge about the quantum mechanical state based on a certain set of measured 
data. To be more specific, any quantum-mechanical reconstruction scheme is nothing more than an a posteriori 
estimation of the density operator of a quantum-mechanical (microscopic) system based on data obtained with the 
help of a macroscopic measurement apparatus ||. The quality of the reconstruction depends on the "quality" of 
the measured data and the efficiency of the reconstruction procedure with the help of which the data analysis is 
performed. In particular, we can specify three different situations. Firstly, when all system observables are precisely 
measured. In this case the complete reconstruction of an initially unknown state can be performed (we will call this 
the reconstruction on the complete observation level). Secondly, when just part of the system observables is precisely 
measured then one cannot perform a complete reconstruction of the measured state. Nevertheless, the reconstructed 
density operator still uniquely determines mean values of the measured observables (we will denote this scheme as 
reconstruction on incomplete observation levels). Finally, when measurement does not provide us with sufficient 
information to specify the exact mean values (or probability distributions) but only the frequencies of appearances of 
eigenstates of the measured observables, then one can perform an estimation (e.g. reconstruction based on quantum 
Bayesian inference) which is the "best" with respect to the given measured data and the a priori knowledge about 
the state of the measured system. 



A. Complete observation level 



Providing all system observables (i.e., the quorum (Q,g|) have been precisely measured, then the density operator of 
a quantum- mechanical system can be completely reconstructed (i.e., the density operator can be uniquely determined 
based on the available data). In principle, we can consider two different schemes for reconstruction of the density 
operator (or, equivalently, the Wigner function) of the given quantum-mechanical system. The difference between 
these two schemes is based on the way in which information about the quantum-mechanical system is obtained. The 
first type of measurement is such that on each element of the ensemble of the measured states only a single observable 
is measured. In the second type of measurement a simultaneous measurement of conjugate observables is assumed. 
We note that in both cases we will assume ideal, i.e., unit-efficiency, measurements. 



1. Quantum tomography 

When the single-observable measurement is performed, a distribution w\y\(A) for a particular observable A of the 
state \ip) is obtained in an unbiased way [[|, i.e., w^(A) = | (^>a\^) | 2 , where \<&a) are eigenstates of the observable 
A such that J2 A \&a) {®a\ = 1- Here a question arises: What is the smallest number of distributions w\^(A) 
required to determine the state uniquely? If we consider the reconstruction of the state of a harmonic oscillator, then 
this question is directly related to the so-called Pauli problem j7| of the reconstruction of the wave-function from 
distributions w\^(q) and w\^(p) for the position and momentum of the state |^/). As shown by Gale, Guth and 
Trammel Q the knowledge of w^(q) and w^(j>) is not in general sufficient for a complete reconstruction of the 
wave (or, equivalently, the Wigner) function. In contrast, one can consider an infinite set of distributions (xg, 9) 
of the rotated quadrature xg = qcosO + psmO. Each distribution w\^(xg,9) can be obtained from a measurement 
of a single observable xg, in which case a detector (filter) is prepared in an eigenstate \xg) of this observable. It has 
been shown by Vogel and Risken || that from an infinite set (in the case of the harmonic oscillator) of the measured 
distributions w^) (xg, 9) for all values of 9 such that [0 < 6 < 7r], the Wigner function can be reconstructed uniquely 
via the inverse Radon transformation. In other words knowledge of the set of distributions (xg, 9) is equivalent to 
knowledge of the Wigner function. This scheme for reconstruction of the Wigner function (i.e., the optical homodyne 
tomography) has recently been realized experimentally by Raymer and his coworkers [ pL0|Jl 1| . In these experiments 
the Wigner functions of a coherent state and a squeezed vacuum state have been reconstructed from tomographic 
data. Very comprehensive discussion of the quantum homodyne tomography can be found in the book by Leonhardt 
p2| and the review article by Welsch, Vogel and Opatrny pq ]. Quantum homodyne tomography can be efficiently 
performed not only with the help of the inverse Radon transformation but also with the help of the so-called pattern 
functions 0,^5| • Other theoretical concept of Wigner- function reconstruction has been considered by Royer |T(| . 

Quantum-state tomography can be applied not only to optical fields but also for reconstruction of other physical 
systems. In particular, recently Janicke and Wilkens jlTj have suggested that Wigner functions of atomic waves can be 
tomographically reconstructed. Kurtsiefer et al. Jl8[ ] have performed experiments in which Wigner functions of matter 
wave packets have been reconstructed. Yet another example of the tomographic reconstruction is a reconstruction of 
Wigner functions of vibrational states of trapped atomic ions theoretically described by a number of groups and 
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experimentally measured by Leibfried et al. [M ■ Vibrational motional states of molecules have also been reconstructed 
by this kind of quantum tomography by Dunn et al. |2l|| . 

Leonhardt |^2| has recently developed a theory of quantum tomography of discrete Wigner functions describing 
states of quantum systems with finite-dimensional Hilbert spaces (for instance, angular momentum or spin). We 
note that the problem of reconstruction of states of finite-dimensional systems is closely related to various aspects of 
quantum information processing, such as reading of registers of quantum computers |23|] . This problem also emerges 
when states of atoms are reconstructed (see, for instance, ]24[|). 

Here we stress once again, that reconstruction on the complete observation level (such as quantum tomography) 
is a deterministic inversion procedure which helps us to "rewrite" measured data in the more convenient form of a 
density operator or a Wigner function of the measured state. 

2. Filtering with quantum rulers 

For the case of simultaneous measurement of two non-commuting observables (let us say q and p) , it is not possible 
to construct a joint cigcnstate of these two operators, and therefore it is inevitable that the simultaneous measurement 
of two non-commuting observables introduces additional noise (of quantum origin) into measured data. This noise is 
associated with Heisenberg's uncertainty relation and it results in a specific "smoothing" (equivalent to a reduction 
of resolution) of the original Wigner function of the system under consideration (see |25|] and [^6| and the reviews 
p7| , p8|] ). To describe the process of simultaneous measurement of two non-commuting observables, Wodkiewicz p9| ] 
has proposed a formalism based on an operational probability density distribution which explicitly takes into account 
the action of the measurement device modeled as a "filter" (quantum ruler). A particular choice of the state of the 
ruler samples a specific type of accessible information concerning the system, i.e., information about the system is 
biased by the filtering process^] The quantum-mechanical noise induced by filtering formally results in a smoothing of 
the original Wigner function of the measured state |25|,^6| , so that the operational probability density distribution can 
be expressed as a convolution of the original Wigner function and the Wigner function of the filter state. In particular, 
if the filter is considered to be in its vacuum state then the corresponding operational probability density distributions 
is equal to the Husimi (Q) function J25|. The Q function of optical fields has been experimentally measured using 
such an approach by Walker and Carroll |s3]. The direct experimental measurement of the operational probability 
density distribution with the filter in an arbitrary state is feasible in an 8-port experimental setup of the type used 
by Noh, Fougeres and Mandel @ (see also @|||). 

As a consequence of a simultaneous measurement of non-commuting observables the measured distributions are 
fuzzy (i.e., they are equal to smoothed Wigner functions). Nevertheless, if detectors used in the experiment have 
unit efficiency (in the case of an ideal measurement), the noise induced by quantum filtering can be "separated" 
from the measured data and the density operator (Wigner function) of the measured system can be "extracted" from 
the operational probability density distribution. In particular, the Wigner function can be uniquely reconstructed 
from the Q function (for more details see pq]). This extraction procedure is technically quite involved and it suffers 
significantly if additional stochastic noise due to imperfect measurement is present in the data. 

We note that propensities, and in particular Q-functions, can also be associated with discrete phase space and 
they can in principle be measured directly p6[ . These discrete probability distributions contain complete information 
about density operators of measured systems. Consequently, these density operators can be uniquely determined from 
the discrete-phase space propensities. 

B. Reduced observation levels and MaxEnt principle 

As we have already indicated it is well understood that density operators (or Wigner functions) can, in principle, 
be uniquely reconstructed using either the single observable measurements (optical homodyne tomography) or the 
simultaneous measurement of two non-commuting observables. The completely reconstructed density operator (or, 
equivalently, the Wigner function) contains information about all independent moments of the system operators. For 



x The quantum filtering, i.e. the measurement with "unsharn observables" belongs to a class of generalized POVM (positive 
operator value measure) measurements 1^,0 . In Section [x] we will show that POVM measurements are in some cases the 
most optimal one when the state estimation is based on measurements performed on finite ensembles. 
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example, in the case of the quantum harmonic oscillator, the knowledge of the Wigner function is equivalent to the 
knowledge of all moments ((a') m d n ) of the creation (a') and annihilation (a) operators. 

In many cases it turns out that the state of a harmonic oscillator is characterized by an infinite number of inde- 
pendent moments {(a)) m a n ) (for all m and n). Analogously, the state of a quantum system in a finite-dimensional 
Hilbert space can be characterized by a very large number of independent parameters. A complete measurement 
of these moments may take an infinite time to perform. This means that even though the Wigner function can in 
principle be reconstructed the collection of a complete set of experimental data points is (in principle) a never ending 
process. In addition the data processing and numerical reconstruction of the Wigner function are time consuming. 
Therefore experimental realization of the reconstruction of the density operators (or Wigner functions) for many 
systems can be difficult. 

In practice, it is possible to measure just a finite number of independent moments of the system operators, so that 
only a subset G v (y = 1,2, ...,n) of observables from the quorum (this subset constitutes the so-called observation 
level |37|]) is measured. In this case, when the complete information about the system is not available, one needs 
an additional criterion which would help to reconstruct (or estimate) the density operator uniquely. Provided mean 
values of all observables on the given observation level are measured precisely, then the density operator (or the 
Wigner function) of the system under consideration can be reconstructed with the help of the Jaynes principle of 
maximum entropy (the so called MaxEnt principle) J37J (see also J38] jjofl). The MaxEnt principle provides us with a 
very efficient prescription to reconstruct density operators of quantum-mechanical systems providing the mean values 
of a given set of observables are known. It works perfectly well for systems with infinite Hilbert spaces (such as 
the quantum-mechanical harmonic oscillator) as well as for systems with finite-dimensional Hilbert spaces (such as 
spin systems). If the observation level is composed of the quorum of the observables (i.e., the complete observation 
level), then the MaxEnt principle represents an alternative to quantum tomography, i.e. both schemes are equally 
suitable for the analysis of the tomographic data (for details see jfl]]). To be specific, the observation level in this 
case is composed of all projectors associated with probability distributions of rotated quadratures. The power of 
the MaxEnt principle can be appreciated in analysis of incomplete tomographic data. In particular cases MaxEnt 
reconstruction from incomplete tomographic data can be several orders better than a standard tomographic inversion 
(see Section VI). This result suggests that the MaxEnt principle is the conceptual basis underlying incomplete 
tomographic reconstruction (irrespective whether this is employed in continuous or discrete phase spaces). 



C. Incomplete measurement and Bayesian inference 



It has to be stressed that the Jaynes principle of maximum entropy can be consistently applied only when exact 
mean values of the measured observables are available. This condition implicitly assumes that an infinite number of 
repeated measurements on different elements of the ensemble has to be performed to reveal the exact mean value of 
the given observable. In practice only a finite number of measurements can be performed. What is obtained from 
these measurements is a specific set of data indicating the number of times the eigenvalues of given observables have 
appeared (which in the limit of an infinite number of measurements results in the corresponding quantum probability 
distributions). The question is how to obtain the best a posteriori estimation of the density operator based on the 



measured data. Helstrom 30 1, Holevo |3l|, and Jones |42| have shown that the answer to this question can be 



given by the Bayesian inference method, providing it is a priori known that the quantum-mechanical state which 
is to be reconstructed is prepared in a pure (although unknown) state. When the purity condition is fulfilled, then 
the observer can systematically estimate an a posteriori a probability distribution in an abstract state space of the 
measured system. It is this probability distribution (conditioned by the assumed Bayesian prior) which characterizes 
observer's knowledge of the system after the measurement is performed. Using this probability distribution one can 
derive a reconstructed density operator, which however is subject to certain ambiguity associated with the choice of 
the cost function (see ref. pC[ , p. 25). In general, depending on the choice of the cost function one obtains different 
estimators (i.e., different reconstructed density operators). In this paper we adopt the approach advocated by Jones 
p2[ when the estimated density operator is equal to the mean over all possible pure states weighted by the estimated 
probability distribution (see below in Section IX ) . We note once again that the quantum Bayesian inference has been 
developed for a reconstruction of pure quantum mechanical states and in this sense it corresponds to an averaging 
over a generalized microcanonical ensemble. Nevertheless it can also be applied for a reconstruction of impure states 
of quantum systems |43| . The Bayesian inference based on appropriate a priori assumptions in the limit of infinite 
number of measurements results in the same estimation as the reconstruction via the MaxEnt principle. 
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D. The optimal generalized measurements 



The quantum Bayesian inference allows us to estimate reliably the quantum state from a given set of measured 
data obtained in a specific measurement performed on a finite ensemble of identically prepared quantum objects. But 
the measurement itself may be designed very badly, i.e. the chosen observables do not efficiently reveal the nature 
of the state. Therefore the question is: Given the finite ensemble of N identical quantum objects prepared in an 
unknown quantum state. What is the the optimal measurement which should provide us the best possible estimation 
of this unknown state? Holevo |H| (see also Q]) has solved this problem. He has shown that the so called covariant 
generalized measurements are the optimal one. The problem is that these generalized measurements are associated 
with an infinite (continuous) number of observables. This obviously is physically unrealizable measurement. On the 
other hand it has been recently shown p5|| how to find a finite optimal generalized measurements. This allows to 
design optimal measurements such that the data obtained in these measurements allow for best estimation of quantum 
states. 

The purpose of the present paper is to show how the various estimation procedures can be applied in different 
situations. In particular, we show how the MaxEnt principle can be applied for a reconstruction of quantum states 
of light fields and spin systems. We show how the quantum Bayesian inference can be used for a reconstruction 
of spin systems and how it is related to the reconstruction via the MaxEnt principle. We also present a universal 
algorithm which allows us to "construct" the optimal generalized measurements. The paper is organized as follows: 
In Section |l| we briefly describe main ideas of the MaxEnt principle. In Section III we set up a scene for a description 
of reconstruction of quantum states of light fields. In this section we briefly discuss the phase-space formalism which 
can be used for a description of quantum states of light. In addition we introduce the states of light which are going to 
be considered later in the paper. In Section IV we introduced of various observation levels suitable for a description of 
light fields. Reconstruction of Wigner functions of light fields on these observation levels is then discussed in Section [v|. 
In Section VI we present results of numerical reconstruction of quantum states of light from incomplete tomographic 
data. We compare two reconstruction schemes: reconstruction via the MaxEnt principle and the reconstruction via 
direct sampling (i.e., the tomography reconstruction via pattern functions - see below in Section III). We analyze the 
reconstruc tion o f spin systems via the MaxEnt principle in Section VII . The Bayesian quantum inference is discussed 
in Section VIII and its application to spin systems is presented in Section [X . Finally, in Section [x] we discuss how 
the optimal realizable (i.e. finite) measurements can be designed. 



II. MAXENT PRINCIPLE AND OBSERVATION LEVELS 



The state of a quantum system can always be described by a statistical density operator p. Depending on the 
system preparation, the density operator represents either a pure quantum state (complete system preparation) or a 
statistical mixture of pure states (incomplete preparation). The degree of deviation of a statistical mixture from the 
pure state can be best described by the uncertainty measure n[p] (see P, |3^ JIo| , [46f ) 

V [p]=-Tr(plnp). (2.1) 



The uncertainty measure rj[p\ possesses the following properties: 
1. In the eigenrepresentation of the density operator p 



p\r m ) = r m \r m ), (2.2) 

we can write 

v[p\ = r m lnr m > 0, (2.3) 

m 

where r m are eigenvalues and \r m ) the eigenstates of p. 

2. For uncertainty measure rj[p\ the following inequality holds: 

< r)[p] < In TV, (2.4) 

where N denotes the dimension of the state space of the system and rj[p\ takes its maximum value when 



1 1 

TrT ~~ iV 5 



(2.5) 
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In this case all pure states in the mixture appear with the same probability equal to 1/N. If the system is prepared 

in a pure state then it holds that n[p\ = 0. 

3. It can be shown with the help of the Liouville equation 

lp(t)=-~[H,p(t)], (2.6) 

that in the case of an isolated system the uncertainty measure is a constant of motion, i.e., 

dv(t) 



dt 



= 0. (2.7) 



A. MaxEnt principle 

When instead of the density operator p, expectation values G v of a set O of operators G u (v = 1, . . . , n) are given, 
then the uncertainty measure can be determined as well. The set of linearly independent operators is referred to 
as the observation level O p8| , p[ |. The operators G v which belong to a given observation level do not commutate 
necessarily. A large number of density operators which fulfill the conditions 

Tr/3 {6} = 1, (2.8) 

r ^{p{G} G u) = Gv, v = l,2,...,n; 



can be found for a given set of expectation values G v = (G„), that is the conditions (2.8) specify a set C of density 
operators which has to be considered. Each of these density operators P{q} can posses a different value of the 
uncertainty measure ^[/Sj-gj.]. If we wish to use only the expectation values Gu of the chosen observation level for 
determining the density operator, we must select a particular density operator p^y — ®{g} m an unbiased manner. 
According to the Jaynes principle of the Maximum Entropy [|37j-fl0[ this density operator &^ G y must be the one which 
has the largest uncertainty measure 



(2.9) 

and simultaneously fulfills constraints (j2Ji|). As a consequence of Eq.( |2."9| ) the following fundamental inequality holds 

^{G}] = _Tr (^{G} ln& {6}) ^ T ^{G}] = _Tr (^{G} ln P{G}) ( 2 - 10 ) 



for all possible P0j which fulfill Eqs.(2.8). The variation determining the maximum of ^[ovg-j.] under the conditions 



(2.8) leads to a generalized canonical density operator f37| , j38 40,41] 



a {6} = -^—e X p(-^2x v d v y, (2.11) 



Z {G} 

Z {G} (Xu X n ) = Tr[exp(- ^ X U G V )], (2.12) 

V 

where X n are the Lagrange multipliers and ^r^-, (Ai, . . . A n ) is the generalized partition function. By using the 
derivatives of the partition function we obtain the expectation values G v as 

G v = Tr(a {d} G„) = - A lnZ {6} (Ai, A„), (2.13) 

where in the case of noncommuting operators the following relation has to be used 

l 

^- exp[-A(a)] = exp[-A(a)] f exp\pX(a)] exp[-p,X(a)] dp. (2.14) 



G 



By using Eq.( 2.13j ) the Lagrange multipliers can, in principle, be expressed as functions of the expectation values 

K = A„(Gi, G n ). (2.15) 



We note t hat Eqs.(2.13) for Lagrange multipliers not always have solutions which lead to physical results (see Sec 
tion 
level 



VI B ) , which means that in these cases states of quantum systems cannot be reconstructed on a given observation 
The maximum uncertainty measure regarding an observation level C^gj. will be referred to as the entropy 

S {g} = fmax = -Tr(<r {(5} lna {&} ). (2.16) 



This means that to different observation levels different entropies are related. By inserting <J{ G } [cf. Eq.(2Tl)] into 
Eq.(f2~l6l), we obtain the following expression for the entropy 



^{G} _ ln Z {G} 



(2.17) 



By making use of Eq.( 2.15| ), the parameters A„ in the above equation can be expressed as functions of the expectation 
values G„ and this leads to a new expression for the entropy 



We note that using the expression 



dS 



{G} 



which follows from Eqs.( 2.13 ) and ( |2.17| ) the following relation can be obtained 

d 



dG v 



S(Gl, G n )- 



(2.18) 



(2.19) 



(2.20) 



B. Linear transformations within an observation level 



An observation level can be defined either by a set of linearly independent operators {G u }, or by a set of independent 
linear combinations of the same operators 



G u ^ ^ c avG v . 



(2.21) 



Therefore, a and S are invariant under a linear transformation: 



exp(-EA'G') 



< G '> Trcxp(-EA' M G;) 



'{GL 



(2.22) 



As a result, the Lagrange multipliers transform contravariantly to Eq.(2.21), i.e., 

A^ = ^ t C ^!,A„ , 



(2.23) 



(2.24) 
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C. Extension and reduction of the observation level 



If an observation level O 



expectation values Mi 



{G} 
(Mi). 



G\, . . . , G n is extended by including further operators Mi, . . . , Mi, then additional 
. . , Mi = (Mi) can only increase amount of available information about the state of 



the system. This procedure is called the extension of the observation level (from 0{ G y to 0^ G ^-i) and is associated 
with a decrease of the entropy. More precisely, the entropy Sr G £^ of the extended observation level 0^ G can be 
only smaller or equal to the entropy S^ G ^ of the original observation level 

{G,M} — S{G} ■ 

The generalized canonical density operator of the observation level 0^ G j^-, 

1 



a {G,M} 



exp 



j {gSi} 



i 



with 



Z 



{G,M} 



Tr 



exp 



\yG V 



I N 

K ^ M ^ 



(2.25) 



(2.26) 



(2.27) 



belongs to the set of density operators P{q] fulfilling Eq.(2.S). Therefore, Eq.( 2.26| ) is a special case of Eq.( [2.11 ). 
Analogously to Eqs.(2.13) and (2.15), the Lagrange multipliers can be expressed by functions of the expectation values 



\ v (Gi, . 
: >v(Gi, 



, , G n , Mi , . 
., G n , Mi, 



;Ml), 

.,M). 



(2.28) 



The sign of equality in Eq.(2.25) holds only for = 0. In this special case the expectation values M M are functions 
of the expectation values G v . The measurement of observables M n does not increase information about the system. 
Consequently, p {G A y } = p {G} and S {G ^ } = S {G} . 

We can also consider a reduction of the observation level if we decrease number of independent observables which are 
measured, e.g., 0^ G ^ — » 0{ G }- This reduction is accompanied with an increase of the entropy due to the decrease 
of the information available about the system. 



D. Time dependent entropy of an observation level 



If the dynamical evolution of the system is governed by the evolution superoperator U(t,to), such that p(t) = 
U(t, t n )p(t n ), then expectation values of the operators G v on the given observation level at time t read 



G v {t) = Tr[G v U(t,t )p(t )}. 



(2.29) 



By using these time-dependent expectation values as constraints for maximizing the uncertainty measure r][P{ G y(t)], 
we get the generalized canonical density operator 



'{G} 



(t) 



exp -EA,(*)G t 



Tr 



ex P 



and the time-dependent entropy of the corresponding observation level 



S {G} (t) = -Tr[a {G} (t) In a {G} (*)] = In Z {G} (t) + £ K (t) G v {t) 



(2.30) 



(2.31) 



This generalized canonical density operator does not satisfy the von Neumann equation but it satisfies an integro- 
diffcrential equation derived by Robertson |47| (see also [Q ) . The time-dependent entropy is defined for any system 
being arbitrarily far from equilibrium. In the case of an isolated system the entropy can increase or decrease during 
the time evolution (see, for example Ref. Eo|, Sec. 5.6). 
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III. STATES OF LIGHT: PHASE-SPACE DESCRIPTION 



Utilizing a close analogy between the operator for the electric component E(r,t) of a monochromatic light field 
and the quantum-mechanical harmonic oscillator we will consider a dynamical system which is described by a pair of 
canonically conjugated Hermitean observables q and p, 

[q,p]=ih. (3.1) 

Eigenvalues of these operators range continuously from — oo to +00. The annihilation and creation operators a and 
cv can be expressed as a complex linear combination of q and p: 

' (Xq + iX^p); a f = -^(\q-i\- 1 p), (3.2) 



'2% x ' V2h 

where A is an arbitrary real parameter. The operators a and a' obey the Weyl-Heisenberg commutation relation 

[6,0^ = 1, (3.3) 

and therefore possess the same algebraic properties as the operator associated with the complex amplitude of a har- 
monic oscillator (in this case A = yjmoj, where m and ui are the mass and the frequency of the quantum-mechanical 
oscillator, respectively) or the photon annihilation and creation operators of a single mode of the quantum electro- 
magnetic field. In this case A = y/eotu (eo is the dielectric constant and co is the frequency of the field mode) and the 
operator for the electric field reads (we do not take into account polarization of the field) 

E(r, t) = V2£ Q {ae- lut + a)e lult ) u(r), (3.4) 

where u(r) describes the spatial field distribution and is same in both classical and quantum theories. The constant 
£0 = (hu;/2eoV) 1 / 2 is equal to the "electric field per photon" in the cavity of volume V. 

A particularly useful set of states is the overcomplete set of coherent states \a) which are the eigenstates of the 
annihilation operator a: 

a\a) = a\a). (3-5) 

These coherent states can be generated from the vacuum state |0) [defined as a\0) =0] by the action of the unitary 
displacement operator D{a) [jl9[ 

D(a) ee exp [ao* - a*a] ; |a) = D(a)\0). (3.6) 

The parametric space of eigenvalues, i.e., the phase space for our dynamical system, is the infinite plane of eigenvalues 
(<7,p) of the Hermitean operators q and p. An equivalent phase space is the complex plane of eigenvalues 

' (Xq + iX^p); (3.7) 



2h 

of the annihilation operator a. We should note here that the coherent state \a) is not an eigenstate of either q or p. 
The quantities q and p in Eq.(3.7) can be interpreted as the expectation values of the operators q and p in the state 
I a). Two invariant differential elements of the two phase-spaces are related as: 

-d 2 a = -d[Re(a)} d[Im(a)l = -^—dqdp. (3.8) 

7T 7T 2'Kh 

The phase-space description of the quantum-mechanical oscillator which is in the state described by the density 
operator p (in what follows we will consider mainly pure states such that p = is based on the definition of 

the Wigner function [^0| Wp(^). Here the subscript p in the expression Wp(^) explicitly indicates the state which is 
described by the given Wigner function. 

The Wigner function is related to the characteristic function C p W ^ (r/) of the Weyl-ordered moments of the annihi- 
lation and creation operators of the harmonic oscillator as follows [BB 

W P (0 = \l C [ P (V) exp(£if - Cv) d\ (3.9) 







The characteristic function cj^^ (rj) of the system described by the density operator p is defined as 



Cj%) = Tr[pD(ri)}, 



(3.10) 



where D{rf) is the displacement operator given by Eq.(3.6). The characteristic function C^'irj) can be used for the 



evaluation of the Weyl-ordered products of the annihilation and creation operators: 



n(m+n) 



(3.11) 



On the other hand the mean value of the Weyl-ordered product ({(a') m a"}) can be obtained by using the Wigner 
function directly 



Tr [{(tf) m a n }p] = - / d^iCT'CW^) 



For instance, the Weyl-ordered product ({a'a }) can be evaluated as: 



({a f a 2 }) = -(a f a 2 + da) d + a 2 a f ) 



(3.12) 



(3.13) 



In this paper we will several times refer to mean values of central moments and cumulants of the system operators 
a and a) . We will denote central moments as (■■■)^ and in what follows we will consider the Weyl-ordered central 
moments which are defined as: 



({(at) m a"})W = ({(at - ^))™(a - (a))"}) 



t _ /rf\\ m < 



(3.14) 



From this definition it follows that the central moments of the order k (k = m + n) can be expressed by moments of 
the order less or equal to k. On the other hand we denote cumulants as ((■••))• The cumulants are usually defined via 
characteristic functions. In particular, the Weyl-ordered cumulants arc defined as 



(({(at)™a«})) 



dr ,rn d{ - V *)n lnC P W 



(3.15) 



r/=0 



where C9^(rj) is the characteristic function of the Weyl-ordered moments given by Eq.(3.10). The cumulants of the 



order k (k = m + n) can be expressed in terms moments of the order less or equ al to k. 

Originally the Wigner function was introduced in a form different from (3.9). Namely, the Wigner function was 
defined as a particular Fourier transform of the density operator expressed in the basis of the eigenvectors \q) of the 
position operator q: 



Wp{q, P ) = / dc( q - ami + c/2>e^/ r \ 

J —oo 

which for a pure state described by a state vector \^} (i.e., p = |^)(^|) reads 

Wp(q,p) = / d(i>(q - C/2W(q + C/2)e lpC/ \ 



(3.16) 



(3.17) 



where ip(q) = (qj^). It can be shown that both definitions (3J3) and ( 3.16 ) of the Wigner function are identical (see 
Hillery et al. p0|), providing the parameters £ and £* are related to the coordinates q and p of the phase space as: 



(Xq + iA _1 p) 



c = 



(Xq - iX 1 p) , 



i.e., 



-t(?p -pq ) 



dq 1 dp', 



(3.18) 



(3.19) 
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where the characteristic function Ci is given by the relation 

C p W \q,p)=Tr[pD(q,p) . 

The displacement operator in terms of the position and the momentum operators reads 

i 

D(q,p) = exp -(qp-pq) 
The symmetrically ordered cumulants of the operators q and p can be evaluated as 

«{p m <f}» = hn+m - 



9(-«# lnC ^ ( " ) 



(3.20) 



(3.21) 



(3.22) 



q,p=0 



The Wigner function can be interpreted as the quasiprobability (see below) density distribution through which a 
probability can be expressed to find a quantum-mechanical system (harmonic oscillator) around the "point" (q,p) of 
the phase space. 

With the help of the Wigner function Wp(q,p) the position and momentum probability distributions Wp(q) and 
Wp(p) can be expressed from Wp(q,p) via marginal integration over the conjugated variable (in what follows we assume 
A = 1) 



Wp{q) 



1 



dpW p {q,p) = V2^n(q\p\q}, 



(3.23) 



where \q) is the eigenstate of the position operator q. The marginal probability distribution Wp(q) is normalized to 
unity, i.e., 



1 



'2-Kh 



dqw p (q) = 1. 



(3.24) 



A. Quantum homodyne tomography 



The relation (3.23) for the probability distribution Wp(q) of the position operator q can be generalized to the case 
of the distribution of the rotated quadrature operator xg. This operator is defined as 



(3.25) 



x e = W - [ae- 1 " + aV e 
and the corresponding conjugated operator X0 +7r / 2 j such that [xg, xg +7T /2] — reads 



Xg+yr/2 = [ae~ 



a)e w 



(3.26) 



The position and the momentum operators are related to the oper ator xg as, q = x q and x n /2 — P- The rotation (i.e., 
the linear homogeneous canonical transformation) given by Eqs.(3.25) and (3.26) can be performed by the unitary 
operator U(9): 



so that 



U(9) = exp [-i9a<a] , 
x e = uH9)x U(9); x e+7r/2 = U^(9)x 7t/2 U{9). 



Alternatively, in the vector formalism we can rewrite the transformation (3.28) as 



xg 

x 6+tt/2 



cos # sin t 
— sin 9 cos ( 



(3.27) 
(3.28) 

(3.29) 
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Eigenvalues xg and xg+ir/2 of the operators xg and xg +n / 2 can be expressed in terms of the eigenvalues q and p of 
the position and momentum operators as: 



xg 

x 6+tt/2 



xg 



cos 9 
sin 6 



— sin 9 
cos 9 



(3.30) 



where the matrix F is given by Eq.( |3.29 ) and F 1 is the corresponding inverse matrix. It has been shown by Eker t 
and Knight |52] that Wigner functions are transformed under the action of the linear canonical transformation (3.2£) 



Wp(q,p) — > Wp(F 1 (xg,x e+n / 2 )) = W p (xg cos 9 - x e+7T / 2 sinfl; xg sin 9 + x g+v / 2 cos6 
which means that the probability distribution Wp(xg,9) = V2nh(xg\p\xg) can be evaluated as 

1 f°° 

w p (xg,9)= — / dx g+7V / 2 Wp(xgcos9 - x g+7r/2 sm9;xgsm9 + x g+7r / 2 cos9). 



(3.31) 



(3.32) 



As shown by Vogel and Risken [|9| (see also |12|- |l4| , p3|l ) the knowledge of Wp{xg,9) for all values of 9 (such that 
[0 < 9 < 7r] ) is equivalent to the knowledge of the Wigner function itself. This Wigner function can be obtained from 
the set of distributions Wp(xg, 9) via the inverse Radon transformation: 



W p (q,p) = 



'2nh) 3 / 2 J_ 



dxk 



d£|£| / d9w- p (xg,9)exp 
Jo 



-£,(xg — qcos9 — psint 



(3.33) 



It will be shown later in this paper that the optical homodyne tomography is implicitly based on a measurement of 
all (in principle, infinite number) independent moments (cumulants) of the system operators. Nevertheless, there are 
states for which the Wigner function can be reconstructed much easier than via the homodyne tomography. These are 
Gaussian and generalized Gaussian states which are completely characterized by the first two cumulants of the relevant 
observables while all higher-order cumulants are equal to zero. On the other hand, if the state under consideration 
is characterized by an infinite number of nonzero cumulants then the homodyne tomography can fail because it does 
not provide us with a consistent truncation scheme (see below and [f54f ). As we will show later, the MaxEnt principle 
may help use to reconstruct reliably the Wigner function from incomplete tomographic data. 



1. Quantum tomography via pattern functions 

In a sequence of papers D'Ariano et al. [^4|, Leonhardt et al. |55) and Richter [^6| have shown that Wigner functions 
can be very efficiently reconstructed from tomographic data with the help of the so-called pattern functions. This 
reconstruction procedure is more effective than the usual Radon transformation p5[ . To be specific, D'Ariano et al. 
|l4| have shown that the density matrix p mn in the Fock basisQ can be reconstructed directly from the tomographic 
data, i.e. from the quadrature-amplitude "histograms" (probabilities), w{xg,9) via the so-called direct sampling 
method when 



Pmn 



w(xg,9)F mn (xg,9) dxg d9 : 



(3.34) 



where F mn (xg, 9) is a set of specific sampling functions (see below). Once the density matrix elements are reconstructed 
with the help of Eq.(3.34) then the Wigner function of the corresponding state can be directly obtained using the 
relation 



where W\ m }( n \(q,p) is the Wigner function of the operator |m)(n|. 



(3.35) 



We note that very analogous procedure for a reconstruction of density operators in the quadrature basis has been proposed 
by Kiihn, Welsch and Vogel |53|. 
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A serious problem with the direct sampling method as proposed by D'Ariano et al. E3| is that the sampling functions 
F m n{x9,9) are difficult to compute. Later DAriano, Leonhardt and Paul f35l , j57f have simplified the expression for 
the sampling function and have found that it can be expressed as 

F mn (xe,6) = f mn (x$) exp [i(m - n)9] , (3.36) 

where the so-called pattern function "picks up" the pattern in the quadrature histograms (probability distributions) 
w m n(x0,9) which just match the corresponding density-matrix elements. Recently Leonhardt et al. JL5| have shown 
that the pattern function f m n{xe) can be expressed as derivatives 

fmn{ x ) = ~g^9mn\X)i (3.37) 

of functions g m n{x) which arc given by the Hilbcrt transformation 

9m,n(X) = — / d(, (3.38) 

where V stands for the principal value of the integral and ip n (x) are the real energy eigenfunctions of the harmonic 
oscillator, i.e. the normalizable solutions of the Schrodinger equation 

ft 2 d 2 x 2 \ 

- y^2 + y) Mx) = h(n + l/2)ip n (x), (3.39) 

(we assume m = u> = 1). Further details of possible applications and discussion devoted to numerical procedures of 
the reconstruction of density operators via the direct sampling method can be found in Ref . [CsJ . 

B. States of light to be considered 

In this paper we will consider several quantum-mechanical states of a single-mode light field. In particular, we will 
analyze coherent state, Fock state, squeezed vacuum state, and superpositions of coherent states. 

1. Coherent state 



The coherent state |a) [see Eqs.(3.5-3.6)] is an eigenstate of the annihilation operator a, i.e., |a) is not an eigenstate 



of an observable |49|. The Wigner function [Eq. (p.9|) 1 of the coherent state in the complex £-phase space reads 

Fy| Q) (C) = 2exp(-2|e-a| 2 ); a = a x +ia v , (3.40) 
or alternatively, in the ((/,£>) phase space we have: 



W\ a )(q,p) = — exp 



1 (q~q) 2 1 (p-p) 2 



(3.41) 



where q = v 2ha x j 'A; p — \/2Tia y \, and 



°t = jy and CT P = y- ( 3 - 42 ) 

The mean photon number in the coherent state is equal to n = \a\ 2 . The variances for the position and momentum 
operators are 

(a\(Aq) 2 \a) = Tia 2 ; (a\(Ap) 2 \a) = ticr 2 p , (3.43) 
from which it is seen that the coherent state belongs to the class of the minimum uncertainty states for which 

mf){{Apf) = n^y p = %. (3.44) 
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Using the expression (3.41) for the Wigner function in the (q,p)-phase space we can evaluate the central moments of 
the Weyl-ordered moments of the operators q and p in the coherent state as: 



U/fift\\M -I (2n " 1)!!(2m " m^ q ) n (^ P ) m ; 
m P I) \ . for k = 2n + l or l = 2m + l. 



for k = 2n, I — 2m 



(3.45) 



We see that all central moments of the order higher than second can be expressed in terms of the second-order 
central moments, so we can conclude that the coherent state is completely characterized by four mean values (q); 

(p); (q 2 ), and (p 2 ). With the help of the relation ( |3.20| ) we can find the characteristic function C^\q,p) of the 
symmetrically-ordered moments of the coherent state 



C\^\q,p) = cxp 



2 2 

h qp -h Pq -2h P -2h q 



from which the following nonzero cumulants for the coherent state: 

«$)> = «; {{p))=Pl ((f)) 



H 2 ; 



((p 2 )) 



ha" 2 ,, 



(3.46) 



(3.47) 



can be found. We stress that all other cumulants of the operators q and p are equal to zero. This is due to the fact 
that the characteristic function of the Weyl-ordered moments is an exponential of a polynomial of the second order 
in q and p. 



2. Fock state 



Eigenstates \n) of the photon number operator n 



n = a' a 



2h Kq F > 2' 



(3.48) 



are called the Fock states. The Wigner function of the Fock state |ra) is the £-phase space reads 

W\ n) (0 - 2(-l)" exp (-2|C| 2 ) £ n (m 2 ) , (3-49) 
where C n {x) is the Laguerre polynomial of the order n. In the (q,p) phase space this Wigner function has the form 



W| n) (g,p) = 2(-l) n exp ^ 



q 2 +p 2 



a 2 +p 2 



(3.50) 



The Wigner function ( |3.50 ) does not have a Gaussian form. One can find from Eq.(3.50) the following expressions 
for first few moments of the position and momentum operators: 



(?) 


= (p)=i 


); 


(f) 


= (P 2 ) = 


5(2n + l); 


<? 4 ) 


= (P 4 ) = 


h 2 

— {<on 2 + 6ti + 3) = 
4 


(fP 2 ) 


= (P 2 f) 


h 2 

= —(2n 2 + 2n- 1) 
4 



3 (q 2 ) 2 + (P 2 ) 2 
2 2 
_l (q 2 ) 2 + (p 2 ) 2 
2 2 



(3.51) 



h 2 . 



In addition we find for the characteristic function C^Y\q,p) of the Weyl-ordered moments of the operators q and p 
in the Fock state \n) the expression 

(q 2 +p 2 )] r f(q 2 +p 2 Y 



C\Z\q,p) = exp 



4/? 



2h 



(3.52) 



from which it follows that the Fock state is characterized by an infinite number of nonzero cumulants. On the other 
hand, moments of the photon number operator n in the Fock state \n) are 



(n k ) = n k , 



(3.53) 



from which it follows higher-order moments of the operator n can be expressed in terms of the first-order moment 
and that all central moments (n k )^ are equal to zero. 
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3. Squeezed vacuum state 



The squeezed vacuum state [Q can be expressed in the Fock basis as 

l>7> 



n=0 



(3.54) 



where the squeezing parameter r\ (for simplicity we assume r\ to be real) ranges from —1 to +1. The squeezed vacuum 



state ( |3.54| ) can be obtained by the action of the squeezing operator S(r) on the vacuum state |0) 
\V) =S(r)\0); 



S(r) = exp 



exp 



2 ,a 



f2 



(3.55) 



where the squeezing parameter r € (— oo, +oo) is related to the parameter r\ as follows, rj = tanhr. The mean photon 
number in the squeezed vacuum (3.54) is given by the relation 



V 



(3.56) 



The variances of the position and momentum operators can be expressed in a form (3.43) with the parameters a q and 
a p given by the relations 



1/1 + t) 
2 



1 



1 (I 

2 



n 



i 



(3.57) 



If we assume the squeezing parameter to be real and 77 G [0, —1) then from Eq.(3.57) it follows that fluctuations in 
the momentum are reduced below the vacuum state limit Ti/2 at the expense of increased fluctuations in the position. 
Simultaneously it is important to stress that the product of variances ((Aq) ) and ((Ap) ) is equal to h 2 /4, which 
means that the squeezed vacuum state belongs to the class of the minimum uncertainty states. 
The Wigner function of the squeezed vacuum state is of the Gaussian form 



W M (q,p) = 



1 


f 1 q 2 


1 P 2 


exp 

(JqCTp 


2ho\ 


2k a 2 



(3.58) 



with the parameters c 2 and cr 2 given by Eq.( 3.57 ). From Eq.( 3.58j ) it follows that the mean value of the position 
and the momentum operators in the squeeze d vac uum state are equal to zero, while the higher-order symmetrically- 
ordered (central) moments are given by Eq.( |3.45 ) with the parameters cr 2 and cr 2 given by Eq.( 3.57 ). We see that 
higher-order moments can be expressed in terms of the second-order moments. We can find the expression for the 
characteristic function C^\q,p) for the squeezed vacuum state which reads 



C\™\q,p) = exp 



2 2 

a q 2 a p 2 
-2h P ~2h q 



(3.59) 



from which it directly follows that the squeezed vacuum state is completely characterized by to nonzero cumulants 
((q 2 )) = ha 2 and ((p 2 )) = ha 2 (all other cumulants are equal to zero). 



4- Even and odd coherent states 

In nonlinear optical processes superpositions of coherent states can be produced |59| . In particular, Brune et al. ]6C| l 
have shown that an atomic-phase detection quantum non-demolition scheme can serve for production of superpositions 
of two coherent states of a single-mode radiation field. The following superpositions can be produced via this scheme: 

\a c ) = N^ 2 (\a) + \ - a)) ; N^ 1 = 2 [l + cxp(— 2|a| 2 )] , (3.60) 

and 

\a )=N 1 J 2 (\a)-\-a))- N~ x = 2 [l — exp(— 2|a| 2 )] , (3.61) 
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which are called the even and odd coherent states, respectively. These states have been introduced by Dodonov et 
al. |6l[] in a formal group-theoretical analysis of various subsystems of coherent states. More recently, these states 
have been analyzed as prototypes of superposition states of light which exhibit various nonclassical effects (for the 
review see [j59f ). In particular, quantum interference between component states leads to oscillations in the photon 
number distributions. Another consequence of this interference is a reduction (squeezing) of quadrature fluctuations 
in the even coherent state. On the other hand, the odd coherent state exhibits reduced fluctuations in the photon 
number distribution (sub-Poissonian photon statistics). Nonclassical effects associated with superposition states can 
be explained in terms of quantum interference between the "points" (coherent states) in phase space. The character of 
quantum interference is very sensitive with respect to the relative phase between coherent components of superposition 
states. To illustrate this effect we write down the expressions for the Wigner functions of the even and odd coherent 
states (in what follows we assume a to be real): 

W lae) (q,p) = N e [W\ a) {q,p) + W\_ a) {q,p) + W mt (q,p)] ; (3.62) 



W\ ao) (q,p) = N [W\ a) {q,p) + W\- a) (q,p) - W int {q,p)) 



(3.63) 



where the Wigner funct ions W\+ a \ (q,p) of coherent states | ± a) are given by Eq.( 3.41 ). The interference part of the 
Wigner functions (3.62) and (3.63) is given by the relation 



W int (q,p) 



■ exp 



2ha 2 



2ha 2 



cos 



qp 



(3.64) 



where q — \/2ha (we assume real a) and the variances a 2 and a 2 are given by Eq.( 3.42 ). From Eqs. ( 3.62 )- ( 3.6£ ) 
it follows that the even and odd coherent states differ by a sign of the interference part, which results in completely 
different quantum-statistical properties of thes e states. 

With the help of the Wigner function ( |3.62 ) we evaluate mean values of moments of the operators q and p. The 
first moments are equal to zero, i.e., (q) = (p) = 0, while for higher-order moments we find 



= |fl 



[q- 
(f) =| 

<<z 4 > = * 



(p 4 ) = ZJ t 



8N e a 2 ) ; 
1 - &N e a 2 e- 2a2 
1 + 16N e a 2 (1 
1 - 167V P a 2 e 



2 )] 



(3.65) 



From Eqs. ( 3.65 ) it follows that the even coherent state exhibits the second and fourth-order squeezing in the p- 
quadrature |59||. We do not present explicit expression for higher-order moments, which in general cannot be expressed 
in powers of second-order moments. In terms of the cumulants it means that the even (and odd) coherent states are 
characterized by an infinite number of nonzero cumulants. This can be seen from the expression for the characteristic 
function of the even coherent state which reads 



C^J(q, P ) = 2N e exp 



- a l v 2 



(!) 



exp 



-yy c ° sh ^ 



(3.66) 



IV. OBSERVATION LEVELS FOR SINGLE-MODE FIELD 



In our paper we will consider two different classes of observation levels. Namely, we will consider the phase-sensitive 
and phase-insensitive observation levels. These two classes do differ by the fact that phase-sensitive observation levels 
are related to such operator which provide some information about off-diagonal matrix elements of the density operator 
in the Fock basis (i.e., these observation levels reveal some information about the phase of states under consideration). 
On the contrary, phase-insensitive observation levels are based exclusively on a measurement of diagonal matrix 
elements in the Fock basis. Before we proceed to a detailed description of the phase-sensitive and phase-insensitive 
observation levels we introduce two exceptional observation levels, the complete and thermal observation levels. 



16 



A. Two extreme observation levels 



1. Complete observation level Oa = {(a ) a ; Wk,l} 



The set of operators \n)(m\ (for all n and m) are referred to as complete observation level. Expectation values of 
the operators |n)(ro| are the matrix elements of the density operator in the Fock basis 

(m\p\n) = Tr [p\n) (m\] ; Vn, m, (4-1) 

and therefore the generalized canonical density operator is identical with the statistical density operator 

1 

exp — 

m,n— 



(j = — exp 



(4.2) 



Z = Tr < exp 



- ^2 A m ,„ \n) (m\ 



m,n— 

In this case the entropy So is determined by the density operator p as 

So = -Tr [<7 ln<7 ] = -Tr [plnp] 



This entropy is usually called the von Neumann entropy [|6|j4 
As a consequence of the relation (cf. Sec. 3.3 in p2|) 



= hm y 

E->1 ^ 



k=0 



k\\Jn\ ml 



(at) 



(4.3) 



(4.4) 



(4.5) 



the complete observation level Oq can also be given by a set of operators {(a l ) k a l ; Vfc, 1} or {q k p l ; Vfc, Z}. The Wigner 



function on the complete information level is equal to the Wigner function of the state itself, i.e., Wj- (£) = Wp(^) 



2. Thermal observation level Oth = {er o} 

The total reduction of the complete observation level Oq results in a thermal observation level Oth characterized 
just by one observable, the photon number operator n, i.e., quantum- mechanical states of light on this observation 
level are characterized only by their mean photon number n = (h) . The generalized canonical density operator of this 
observation level is the well-known density operator of the harmonic oscillator in the thermal equilibrium 

o-th = — exp [-Ath«]. (4.6) 

Ah 

To find an explicit expression for the Lagrange multiplier Ath we have to solve the equation 

Tr [a t hfi] = 7i, (4.7) 

from which we find that 

Ath - In , (4.8) 

so that the partition function corresponding to the operator ath reads 

Z t h = {l-exp[- Ath]}" 1 =n+l. (4.9) 
Now we can rewrite the generalized canonical density operator ath m the Fock basis in a form 

oo - n 
n=Q y ' 
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For the entropy of the thermal observation level we find a familiar expression 



Sth = (n+ l)ln(n + 1) - nlnn. (4.11) 

The fact that the entropy 5th is larger than zero for any n > reflects the fact that on the thermal observation 
level all states with the same mean photon number are indistinguishable. This is the reason why Wigner function of 
different states on the thermal information level are identical. The Wigner function of the state \9) on the thermal 
observation level is given by the relation 

2\e 



(th) 



(0 



1 + 2n 



exp 



1 + 2n 



(4.12) 



Extending the thermal observation level we can obtain more "realistic" Wigner functions which in the limit of the 
complete observation level are equal to the Wigner function of the measured state itself, i.e., they are not biased by 
the lack of information (measured data) about the state. 



B. Phase-sensitive observation levels 

1. Observation level 0\ = {a' a, a} , a} 

We can extent the thermal observation level if in addition to the observable n we consider also the measurement 
of mean values of the operators a and fit (that is, we consider a measurement of the observables q and p). If we 
denote the (measured) mean values of this operators as (fi) = 7 and (fit ) = 7* , then the generalized canonical density 
operator <r\ can be written as 

<7i = ^-exp [-Ai(a f - 7*) (a - 7)] , 



with the partition function Zi given by the relation 

Z 1 = (l-e-^) _1 . 
We have chosen the density operator a\ in such form that the conditions 

(fi) = Tr[fi<7i] = 7; (fit) = Tr [fit <jx] = 7*, 
are automatically fulfilled. To see this we rewrite the density operator <j\ in the form: 

b\ = 4- -6(7) exp[-Aia t a]i) t (7), 
Z\ 

where we have used the transformation property D{pf)aD* (7) = fi — 7, and therefore 



Tr[fifii] = — Tr 



7 + — Tr [aexp(— Aifitfi)] 



7- 



L> t (7)aL>(7) exp(-Aifitfi) 

To find the Lagrange multiplier Ai we have to solve the equation Trffitfifii] = fi from which we find 

e _ Al = n-| 7 | 2 
1 + n- I7I 2 

The entropy 5i on the observation level 0\ can be expressed in a form very similar to 5th [see Eq.(l.ll)] 

Si = [n- | 7 | 2 + 1] ln[n - | 7 | 2 + 1] - [n - | 7 | 2 ] ln[n - | 7 | 2 ]- 
The Wigner function W- 1 ^ '(£) corresponding to the generalized canonical density operator &i reads 



l + 2(n-| 7 |2) 



exp 



2|C-7| : 



l + 2(n-| 7 | 2 ) 



(4.13) 

(4.14) 
(4.15) 

(4.16) 

(4.17) 

(4.18) 
(4.19) 

(4.20) 



From the expression (LIE) for the entropy Si it follows that 5i = for those states for which n — \^\ 2 . In fact, there 
is only one state with this property. It is a coherent state \a) ( |3.6| ). In other words, because of the fact that 5i = 0, 
the coherent state can be completely reconstructed on the observation level 0\ . In this case 



= <°>(0 = 2exp [-2|C - a| 2 ] , 



(4.21) 



[see Eq.( 3.40| )]. For other states S\ > and therefore to improve our information about the state we have to perform 
further measurements, i.e., we have to extent the observation level 0\. 
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2. Observation level O2 = {era, (a') ,a ,a , a} 



One of possible extensions of the observation level 0\ can be performed with a help of observables q 2 and p 2 , i.e., 
when not only the mean photon number n and mean values of q and p are known, but also the variances ((AqY) and 
((Ap) 2 ) are measured. On the observation level 2 we can express the generalized canonical operator <r 2 as 

1 



^2 = -z- exp 

^2 



4(« f - 7*) 2 - f (a - 7) 2 - Ai(&t - 7 *)(a - 7 ) 



(4.22) 



where the Lagrange multiplier Ai is real while A2 can be complex: A2 = | A2 |e 16 '■ We can rewrite 02 in a form similar 
to the thermal density operator: 



& 2 = ±D(>y)U(9/2)S(r)exp [- (A 2 - |A 2 | 2 ) 1/2 fit a ] St(r)C^t(fl/2)Dt( 7 ), 
z 2 L J 



(4.23) 



where the operators D{y), U(9/2), and 5(r) are given by Eqs.(3.6), ( 3.27 ), and ( 3.55 ), respectively. These operators 
transform the annihilation operator a as: 



L» t ( 7 )aL)(7) = a + 7; 



tP(9/2)aU(6/2) 



-80/2. 



S 1 ^ (r)aS'(r) = a cosh r + sinh r. 
The partition function .2T 2 in Eq.( 4.23| ) can be evaluated in an explicit form: 

Z 2 - 1 = l-exp[-(A 2 -|A 2 | 2 )^]. 



(4.24) 



(4.25) 



I n Eq .(4.23) we have chosen the parameter r to be given by the relation tanh2r = — |A 2 |/Ai. The density operator 
( 4.23 ) is defined in such way that it automatically fulfills the condition Tr[ao" 2 ] = 7, while the Lagrange multipliers 
Ai and A 2 have to be found from the relations Tr[a^aa 2 ] = n and Tr[a 2 <r 2 ] = /i: 



Tr[a f aCT 2 ] = n = | 7 | 2 - 1/2 + (x + 1/2) cosh2r; 
Tr[a 2 (7 2 ] = fx = 7 2 + c^ 6 ( X + 1/2) sinh2r, 



where we have used the notation 



{exp[(A? - |A 2 | 2 ) 1/2 ] - I}"' 



(4.26) 



(4.27) 



Instead of finding explicit expressions for the Lagrange multipliers Ai and A 2 we can find solutions for the parameters 
tanh2r and \- We express these parameters in terms of the measured central moments (aJa)^ = N = n — | 7 | 2 > 
and (a 2 )^ = M = \M\e~ ie = 



H-j 2 : 



tanh 2r 



\M\ 



N + 1/2' 



(4.28) 



X = [(iV + l/2) 2 -|M| 2 ] 1/2 -l/2. (4.29) 
We remind us that physical requirements |53| lead to the following restrictions on the parameters N and M: 

N > 0; iV(iV + 1) > \M | 2 . (4.30) 

(2) 

Once the tanh2r and \ are found we can reconstruct the Wigner function W~ (£) on the observation level C 2 . 
This Wigner function reads: 



Wf(0 = 



1 



■ exp 



(N + 1/2)|£ - 7 | 2 - ~ if ~ ~ 7*) 5 



[(N + 1/2) 2 - |M| 2 ] 1/2 
Analogously we can find an expression for the entropy 5 2 : 



[(N+ 1/2) 2 -\M\ 2 



(4.31) 
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S 2 = (x + l)ln(x + l)-xbiX- 



(4.32) 



It has a form of the thermal entropy ( 4.11 ) with a mea n thermal-photon number equal to x [ see Eq.( 4.29f )T. 

Using the expression for the Wigner function (4.31) we can rewrite the variances of the position and momentum 
operators in terms of the parameters N and M as follows 



((A<r 



•2JV + 2ReAf]; ((ApY 



The product of these variances reads: 



2^ 



■2N-2ReM). 



(4.33) 



<(A«j) 2 }((Ap) 2 } = — [(1 + 27V) 2 - 4(ReM) 2 ] 



(4.34) 



From the expression ( 4.32| ) for the entropy S2 it is seen that those states for which N(N+ 1) = |M| 2 can be completely 
reconstructed of the observation level O2, because for these states S% = 0. In fact, it has been shown by Dodonov et 
al. |64|] that the states for which N(N + 1) = |M| 2 are the only pure states which have non-negative Wigner functions 
[see Eq.(4.31)]. For these states the product of variances (4.34) reads 



((Aqf)((Ap) 2 ) 



[l + 4(ImM) 2 



(4.35) 



which means that if in addition ImAf = (see for instance squeezed vacuum state with real parameter of squeezing) 
then these states also belong to the class of the minimum uncertainty states. From our previous discussion it follows 
that the squeezed vacuum as well as squeezed coherent states can be completely reconstructed on the observation level 
02- More generally, we can say that all Gaussian states for which N(N + 1) = \M\ 2 can be completely reconstructed 
on this observation level. 



3. Higher-order phase-sensitive observation levels 

There are pure non-Gaussian states (such as the even coherent state) for which the entropy S% is larger than zero 
and therefore in order to reconstruct Wigner functions of such states more precisely, we have to extent the observation 
level 02- Straightforward extension of O2 is the observation level Ok = {(a') m a n ; Vm, n; m + n < k}, which in the 
limit k — > 00 is extended to the complete observation level. 

To perform a reconstruction of the Wigner function on the observation level Ok with k > 2 an attention has to 
be paid to the fact that for a certain choice of possible observables the vacuum-to-vacuum matrix elements of the 
generalized canonical density operator (0| cxfc |0) can have divergent Taylor-series expansion. To be more specific, if we 
consider an observation level such that Ok = {{a^) k ,a k } then for the generalized canonical density operator 

a k = i- exp [-X k (a)) k - X k a k ] , (4.36) 

the corresponding partition function Zk — Trexp [— Xk(a^) k — A£a fe ] is divergent p5| . This means that one cannot 
consistently define an observation level based exclusively on the measurement of the operators (a^) fe and a k . In 
general, to "regularize" the problem one has to include the photon number operator n into the observation level. 
Then the generalized density operator &k 

a k = i- exp [-X a^a - X k (^) k - X* k a k ] , (4.37) 

can be properly defined and one may reconstruct the corresponding Wigner function Wk{Ct- We note that any 
observation has to be chosen in such a way that information about the mean photon number is available, i.e., 
knowledge of the mean photon number (the mean energy) of the system under consideration represents a necessary 
condition for a reconstruction of the Wigner function. 
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C. Phase-insensitive observation levels 



The choice of the observation level is very important in order to optimize the strategy for the measurement and 
the reconstruction of the Wigner function of a given quantum-mechanical state of light. For instance, if we would 
like to reconstruct the Wigner function of the Fock state \n) at the observation level Ok = {a^a, (a^) m a n ; to + n < 
k and to ^ n} we find that irrespectively on the number (k) of "measured" mom ents {{a)) m a n ) (for m 7^ n) the 
reconstructed Wigner function is always equal to the thermal Wigner function ( 4.12 ) . So it can happen that in a very 
tedious experiment negligible information is obtained. On the other hand, if a measurement of diagonal elements of 
the density operator in the Fock basis is performed relevant information can be obtained much easier. 



1. Observation level Oa = {Pn = \n){n\; Vn} 



The most general phase- insensitive observation level corresponds to the case when all diagonal elements P n = (n\p\n) 
of the density operator p describing the state under consideration are measured. The observation level Oa can be 
obtained via a reduction of the complete observation level Oo and it correspo nds to the measurement of the photon 
number distribution P n such that J2 n = !• Because of the relation [see Eq.(|4.5|)] 



= lim > 



k=0 



(—e) k nl 
k\n\ v ' e— >i ^— f k\n\ (n — k — n)V 



(aj) k+n a k+n = lim^- 



(4.38) 



fc=0 



we can conclude that the observation level Oa corresponds to the measurement of all moments of the creation and 
annihilation operators of the form (a*) k a k or, what is the same, it corresponds to a measurement of all moments of 
the photon number operator, i.e., 



A = {P n = \n)(n\; Vn} = {(a t ) fe a fe ; V k} = {n k ; Vfc}. 
The generalized canonical operator a\ at the observation level Oa reads 



o^ = — exp 



n=0 



with the partition function given by the relation 



n=0 



= ^exp[-A„]. 



n=0 



Za = Tr ^ exp 

The entropy Sa at the observation level Oa can be expressed in the form 

00 

Sa = In Za + 2J A n -Pn- 

n=0 

The Lagrange multipliers A„ have to be evaluated from an infinite set of equations: 



P n = Tr[a A P n ] = 



Za 



Vn, 



(4.39) 



(4.40) 



(4.41) 



(4.42) 



(4.43) 



from which we find X n = — ln[ZA-P n ]. If we insert A n into expression ( 4.42 ) we obtain for the entropy 5a the familiar 
expression 



derived by Shannon ((3(| . Here it should be briefly noted that as a consequence of the relation 

00 

£4 = i, 



(4.44) 



(4.45) 



n=0 
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the operators P n are not linearly independent, which means that the Lagrange multipliers A„ and the partition 
function Za are not uniquely defined. Nevertheless, if Za is chosen to be equal to unity, then the Lagrange multipliers 
can be expressed as 



A n — — In P n ; 



(4.46) 



and the generalized canonical density operator reads 



" \ = X! p n\n)(n\; 



Pn = 1. 



(4.47) 



71 = 



From here it follows that the Wigner function W~ A \%) of the state p at the observation level Oa can be reconstructed 
in the form 



oo 



(4.48) 



where W\ n )(£) is the Wigner function of the Fock state \n) given by Eq.( 3.49| ). 

The phase-insensitive observation level Oa can be further reduced if only a finite number of operators P n [where 
n € M] is considered. In this case, in general, we have YlneM P n < 1 and therefore it is essential that apart of mean 
values P n also the mean photon number n is known from the measurement. 



2. Observation level Ob = {n,p2 n = |2n)(2n|; Vn} 

As an example of the observation level which is reduced with respect to Oa we can consider the observation level 
Ob which is based on a measurement of the average photon number n and on the photon statistics on the subspace 
of the Fock space composed of the even Fock states \2n). In this case the generalized canonical density operator <tb 
can be written as 



1 

a B = — exp 



-An - ^ X„P2r. 



71=0 



-Xft 



Zb 



71=0 



71 = 



where the partition function is given by the relation 



Zb = Tr < exp 



-An - A„A 



71=0 



(4.49) 



(4.50) 



This partition function can be explicitly evaluated with the help solutions for the Lagrange multipliers from equations 
Tr[P27i<7B] = P2n- If we introduce the notation 



Podd = 1 — Pin ; 



(4.51) 



71=0 



n dd = n 



(4.52) 



then the partition function Zb can be expressed as 



r_2 p2 ] I/ 2 



IP: 



del 



Analogously we find for the generalized canonical density operator the expression 

oo oo 

(Tb =^P 2n |2n)(2n|+X]P2 n+ i|2n+l)<2n + l|, 



(4.53) 



(4.54) 



71=0 



71=0 
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where Pm are measured values of Pi n and P2n+i are evaluated from the MaxEnt principle: 

n dd — Podd 



P 



2n+l 



^ r odd 

flodd + Podd V ™odd + -Podd 



(4.55) 



From Eq.(|4~55|) we see that on the subspace of odd Fock states we have obtained from the MaxEn t pri nciple a 
"thermal-like" photon number distribution. Now, we know all values of Pm and P2K+1 and using Eq.(4.44) we can 

easily evaluate the entropy 5b and the Wigner function W~ ' (£) on the observation level Ob [see Eq.(4.48)]. 



3. Observation level Oc = {n, P 2 n+i = |2n + l)(2ra + 1|; Vn} 

If the mean photon number and the probabilities Pm+\ = (2n + l|p|2n 4- 1) are known, then we can define an 
observation level Oc which in a sense is a complementary observation level to Ob- After some algebra one can find 
for the generalized canonical density operator &c the expression equivalent to Eq.(4.54), i.e., 



<7 C = p 2n|2n) {2n\ + J2 p 2«+i \2n + l)(2n + 1| 



n=0 n=0 

where the parameters Pm+i are known from measurement and Pi n are evaluated as follows 



P2n 



2Pl 



+ 2Pe 



even \ "'even 



+ 2P e . 



In Eq.(4.57) we have introduced notations 



Peven — 1 ^ ^ P^n+li 



(4.56) 



(4.57) 



(4.58) 



71=0 



.-^(2n+l)P 2 „+i. 



n=0 



The explicit expression for the partition function Zq is 

Z C 



2P 2 



The reconstruction of the Wigner function W\ (£) is now straightforward [see Eq.(4.4 



(4.59) 



(4.60) 



4. Observation level On = {n,P N = |jV){iV|} 

We can reduce observation levels 0a,b,c even further and we can consider only a measurement of the mean photon 
number n and a probability P/v to find the system under consideration in the Fock state \N). The generalized density 
operator <td in this case reads 



ct d = — exp 



—An — XnPn 



(4.61) 



Taking into account the fact that the observables under consideration do commute, i.e., [n, py] = 0, and that the 

as 



operator py is a projector (i.e., Pjy = py) we can rewrite Eq.(4.6l 



- An 



CD 



Zu 



N) 



e _Ajv Pv 



P N \N)(N\ + p nH(r 



(4.62) 
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where A and Xn are Lagrange multipliers associated with operators n and P/v, respectively, and P n = exp(— An)/Zn 

gives the photon number distribution on the subspace of the Fock space without the vector \N). The generalized 
partition function can be expressed as 

Z u = T ^— + x N (y-l), (4.63) 
1 — x 

where we have introduced notation 

x — exp(— A); y — exp(— Ajv). (4.64) 
The Lagrange multipliers can be found from equations 



J_ N _ (1 - x)x N y 
Zu X V ~ l + x«(y~l){l~xY 



P N = —x y = - ; -; (4.65) 



1 



{l-xf 



+ Nx N (y- 1) 



x + Nx N (l -x) 2 (y- 1) 
(l-x)[l + x N (y-l)(l~x)Y 



(4.66) 



Generally, we cannot expre ss the Lagr ange multipliers A and A at as functions of n and Pjv in an analytical way for 
arbitrary N and Eqs.( 4.65 ) and ( 4.66| ) have to be solved numerically. Nevertheless, there are two cases when these 
equations can be solved in a closed analytical form. 

1. If N = (we will denote this observation level as Odi), then we can find for Lagrange multipliers A and Ao 
following expressions: 



c~ A = 1 - 
and for the partition function we find 



1-Pn 



C -A 







(l-Po) 2 
u-(l-Po) 



[n-(l-Po)]; 



Then after some straightforward algebra we can evaluate the parameters P n as 



Pn = 




for n = 0; 

n-(l-Po) 



for n > 0. 



(4.67) 



(4.68) 



(4.69) 



From Eq.( 4.69| ) which describes the photon number distribution obtained from the generalized density operator <tdi 
it follows that the reconstructed state on the observation level Odi has on the subspace formed of Fock states except 
the vacuum a thermal-like character. Nevertheless, in this case the reconstructed Wigner function can be negative 
(unlike in the case of the thermal observation level). This c an happen if Pq is close to zero and n is small. Using 
explicit expressions for the parameters P n given by Eq.(4.69) we can evaluate the entropy Sdi corresponding to the 
present observation level: 



-P lnP - (n-P) ln(n-P) -2PlnP + nlnn, 



where we have used notation P = 1 — P . In the limit P — > (1 + n) 1 expression (4.70) reads 



lim 5di 



(n + 1) ln(n + 1) — n In n, 



(4.70) 



(4.71) 



which is the entropy on the thermal observation level Eq.(4.11). In this limit the Q ui reduces to the thermal 
observation level Oth ■ On the other hand, in the limit Pq — ► wc obtain from Eq.(4.70) 



lim Sbi 

p — o 



nlnn — [n — 1) ln(n — 1), 



(4.72) 



from which it directl y foll ows that in this case the mean photon number has necessary to be larger or equal than unity. 
Moreover, from Eq.(4.72) we see that in the limit n — ► 1 the entropy 5di = which means that the Fock state |1) 
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can be completely reconstructed on the o bservation level 0di- This fact can also be seen from an explicit expression 
for the photon number distribution ( I.6S ) from which it follows that 



lim lim P n 



>n,l- 



(4.73) 



2. If the mean photon number is an integer, then in the case N = n (we will denote this observation level as 0D2) 
we find for the Lagrange multipliers A and Xn=u = the expressions 



l + n 



e _ Afl = (l + n) 



(1 - P fl )n« 



and for the partition function we find 



Zt>2 = 



(l + n) 



l+n 



(l-P B )(l+n) a 

Taking into account the expression for the reconstructed photon number distribution 



Pn = (n <7 D 2 n) = 



^D2 



[l + S n ,n (c 



1)]. 



then with the help of relations (4.74) and (4.75) we find 

f Pn 



n , n = n 

Pn = ( (l-Pn)(l+n)« ( n Y . , 

[ (i+ R )i+fl_fifl \T+li) > n ^ 



(4.74) 



(4.75) 



(4.76) 



(4.77) 



We see that the reconstructed photon-number distribution has a thermal-like character. The corresponding entropy 
can be evaluated in a closed analytical form 



Sd2 



-P n InP a - (1 - P n ) ln(l - Pn) + (1 - Pn) In 



(l + n) 



l+n 



1 



It is interesting to note that if P fi is given by its value in the thermal photon number distribution, i.e., 



(l + n) 



l+n ' 



then the entropy ( |4.78|) reduces to 



5"d2 = (n + 1) ln(n + 1) — n Inn = — \nPn, 



(4.78) 



(4.79) 



(4.80) 



which means that the reconstructed density operator <td2 on t ne observation level Od2 with P„ given by Eq. ( [l.79[ ) is 
equal to the density operator of the thermal field [see Eq.(4.10)] and so, in this case the reduction Od2 — * Cth takes 
place. Obviously, if Pn = 1 then Sx>2 = an d the Fock state fn) can be completely reconstructed on the observation 
level 0D2- 



D. Relations between observation levels 



Various observation levels considered in this section can be obtained as a result of a sequence of mutual reductions. 
Therefore we can order observation levels under consideration. This ordering can be done separately for phase-sensitive 
and phase-insensitive observation levels. In particular, phase-sensitive observation levels are ordered as follows: 

O D 02 3 0i D Oth. (4-81) 

The corresponding entropies are related as 

So < S 2 < Si < S th . (4.82) 

The ordering of phase-insensitive observation levels 0a, 0b , 0c j 0di and 0D2 is more complex. In particular, we 
find 
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On : O a d {% B \- O lU : 



c 



Oo D O a D \ ° D1 \ D O th , (4.83) 



o D2 

O q dO a dO b d O m D O th , 

which reflects the fact that observation levels Ob and Oc (as well as Odi and Od2) cannot be obtained as a result of 
mutual reduction or extension. The corresponding entropies are related as 

So < Sa < | ^ | < S t h', 

So < S A < | | < 5 th , (4.84) 
So < Sa < 5b < Sbi < •S'th- 

For a particular quantum-mechanical state of light observation levels Ox can be ordered with respect to increasing 
values of entropies Sx- From the above it also follows that if the entropy Sx on the observation level Ox is equal 
to zero, then the entropies on the extended observation levels are equal to zero as well. It this case the complete 
reconstruction of the Wigner function of a pure state can be performed on the observation level which is based on a 
measurement of a finite number of observables. 

V. RECONSTRUCTION OF WIGNER FUNCTIONS 
A. Coherent states 



The Wigner function Wj a \(^) of a coherent state \a) on the complete observation level is given by Eq.(3.4C) 
[see Fig.|l|a]. Coherent states are uniquely characterized by their amplitude and phase and therefore phase-sensitive 



observation levels have to be considered for a proper reconstruction of their Wigner functions. In Section IV A 1 
have shown that the Wigner function of coherent states can be completely reconstructed on the observation level 0\ 
(see Figjjja) . Nevertheless it is interesting to understand how Wigner functions of coherent states can be reconstructed 
on phase-insensitive observation levels. 

1. Observation level Oa 

The coherent state \a) has a Poissonian photon number distribution and therefore we obtain for the generalized 
density operator of the coherent state on Oa the expression 

a A = Y,Pn\n)(n\; P n = e -I Q I 2 ^— . (5.1) 

71=0 



This density operator describes a phase-diffused coherent state. Eq.(5.1) can be rewritten in the coherent-state basis 



&A = h «=l«|e # - (5.2) 



From Eqs.(5.1) and (5.2) it follows that on the observation level Oa phase information is completely lost and the 



corresponding Wigner function can be written as 

00 { I 1 2 ^ n 

= 2exp(-2|C| 2 - H 2 ) £ ^f^£„(4|e| 2 ), (5.3) 

or after some algebra we can find 

A '\ a ) 



71 = 



W$(0 = 2exp(-2|£| 2 - 2|a| 2 ) J (4z|a| |£|), (5.4) 



2G 



where Jo(4i|ev| |£|) is the Bessel function 



M^W\ |£|) = £ 



(4H 2 ici 2 r 

\)2 



n=0 



(n!) 



(5.5) 



(a) fi o =Ci = 8 




(b) 



h 1 , 






(h) O n 




FIG. 1. The reconstructed Wigner functions of the coherent state |q) with n = 2. We consider the observation levels as 
indicated in the figure. It is clearly seen that the Wigner function of a coherent state can be easily reconstructed on the most 
simple phase-sensitive observation level 0\ . We note that on some observation levels the reconstructed Wigner function of the 
coherent state may take negative values. 



from which we see that the Wigner function (5.4) is positive. We plot in Fig. [pb. We can understand the 



shape of W\?{£) if we imagine phase-averaging of the Wigner function Wi a \(£) [see Fig. [pa]. On the other hand 



we can represent (£) a s a sum of weighted Wigner functions of Fock states [see Eq.( |5.3| )]. For the considered 
coherent state \a) with the mean photon number n = 2 we have P\ = P2 = 2Po — 2exp(— 2), so the Wigner functions 
of Fock states |1) and |2) dominantly contribute to On the other hand contribution of the Wigner function 

of the vacuum state is suppressed and therefore (£) has a local minimum around the origin of the phase space 
while its maximum is at the same distance from the origin of the phase space as for the Wigner function on the 
complete observation level [see Fig. [pa]. We note that the Wigner function (£) describing the phase-diffused 

coherent state has been experimentally reconstructed recently by Munroe et al. fill]. 
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2. Observation level Ob 



tion 



L et us a ssume that from a measurement the mean photon numbe r n and probabilities P2n are know (see Sec- 
IV C 2). If the values of P2„ are given by Poissonian distribution (5.1), i.e., Pm = exp(— n)n 2n / (2n)l, then using 



definitions (4.51) and (4.52) we can find the parameters P 0( id and n dd to be 



Podd = e " sinh 



n; 



n Q dd = n(l - P dd), 



The reconstructed probabilities p2 n +i are given by Eq.(4.55) and in the limit of large n (when P oc id 
n Q dd — > n/2) they read 



P2n 



+1 



(n- l) n 



(5.6) 
1/2 and 



(5.7) 



With the help of the relation ( 4.48 ) and explicit expressions for P2n and Pm+\ we can evaluate expression for the 
Wigner function of the coherent state on the observation level Ob- We plot W| a \ (0 of the coherent state with the 
mean photon number equal to two (n — 2) in Fig. |l|c. In this case P2 is dominant from which it follows that the Fock 
state 1 2) gives a significant contribution into [compare with Fig. [|b]. 



3. Observation level Oc 

r(C), 



The Wigner function Wj^ (£) of the coherent state on the observation level Oc can be reconstructed in exactly 
same way as on the level Ob- In Fig. |l|d we present a result of this reconstruction. On the observation level Oc 
the contribution of the vacuum state is more significant than in the case Ob whic h is d ue to the thermal-like photon 
number distribution Pan on the even- number subspace of the Fock space [see Eq.(4.57)]. 



4. Observation level Or 



We can easily reconstruct the Wigner function of the coherent state at the observation level Odi- Using general 
expressions from Section IV C 4 we find the following expression for the Wigner function WffliO [we remind us that 
for coherent state the parameter Pq is given by the relation Pq = cxp(— n) 



Po 



1 — -Pi \ , , , ,n 

' * W| >(£) + (l-Po) 



1 



(5.8) 



where Wo)(0 is th e Wigner function of the vacuum state given by Eq.(3.40) and Wth(£) is the Wigner function of a 
thermal state ( 4.12 ) with an effective number of photons equal to n, where 



1-Po 



In particular, from Eqs.(5.S) and (5.9) it follows that 



(5.9) 



(5.10) 



and simultaneously Sbi = 0, which means that the vacuum st ate can be completely reconstructed on the present 
observation level. Another result which can be derived from Eq.(5.8) is that if Po(2n + 1) < 1, then the reconstructed 



Wigner function (£) of the coherent state \a) can be negative due to the fact that the contribution of the Fock 

state |1) is more dominant than the contribution of the vacuum state and then the negativity of the Wigner function 
W\i) (£) results into negative values of W^\^). This means that even though the Wigner function of the state itself 
(i.e., the Wigner function at the complete observation level) is positive, the reconstructed Wigner function can be 
negative. This is a clear indication that the observation level has to be chosen very carefully and that reconstructed 
Wigner functions can indicate nonclassical behavior even in those cases when the measured state itself does not exhibit 
nonclassical effects. In Fig. [ljg we plot the Wigner function W 7 ^ 1 (£) of the coherent state which illustrates this effect. 
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5. Observation level Od2 



If the mean photon number n is an integer, then one may consider the observation level 0D2- The Wigner function 
of the coherent state at this observation level for which P n = exp(— n)n n /(n\) reads 



i 



z 



D2 



"V |fl> (0 " 



1 



z 



D2 



(5.11) 



where W\n) (£) is the Wigner function of the Fock state \n) and Wth(C) i s the Wigner functi on of the thermal state with 



the mean photon number equal to n. The partition function Zrj2 is given by the relation (4.75). The Wigner function 
(5.11) is plotted in Fig. [l]f. From this figure we see that the vacuum state |0) (due to the thermal-like character of the 
reconstructed photon number distribution) and the Fock state \2) (as a consequence of the measurement) dominantly 

contribute to W{% 1) (£). 



B. Squeezed vacuum 



The Wigner function of the squeezed vacuum state ( 3.54| ) on the complete observation level 0o is given by Eq.( p.5q ) 
and is plotted (in the complex £ phase space) in Fig. ^a. This is a Gaussian function, which carries phase information 
associated with the phase of squeezing. On the thermal observation level 0th which is characterized only by the 
mean photon number n the reconstructed Wigner function of the squee zed v acuum state is a rotationally symmetrical 
Gaussian function centered at the origin of the phase space [see Eq.( |4.12j ) and Fig. [ljg]. On the observation level 
0i the reconstructed Wigner function is the same as on the thermal observation level because the mean amplitudes 
(a) and (a') are equal to zero. On the other hand, the Wigner function of the squeezed vacuum can be completely 
reco nstru cted on the observation level 02 . To see this we evaluate the entropy S2 for the squeezed vacuum state [see 
Eq.(4.32)]. The parameters M and N can be expressed in terms of the squeezing parameter rj (we assume r\ to be 
real) as 



N = 



M = 



1 - V 2 



(5.12) 



so that N(N + 1) = M 2 . Consequently the parameter \ given by Eq.( 4.29| ) is equal to zero from which it follows that 
S2 for the squeezed vacuum is equal to zero. 



1. Observation level Oa 



The squeezed vacuum state (3.54) is characterized by the oscillatory photon number distribution P n : 

: (2n) 
[2 n n{ 



P — n „2U/2 ( 2n )' J2n. p _ n 

t\n ~ (1 - V ) f 9 » ,12 ^ . - 0. 



(5.13) 



Using Eq.(4.48) we can express the Wigner function w{^\{;) of the squeezed vacuum on the observation level 0a as 



= 2(1 -^V^X: 



(2n)\r] 2n 
2 2n (n!) 2 



£ 2n (4|£l 2 )- 



(5.14) 



Taking into account that the Wigner function on the observation level Oa can be obtained as the phase-averaged 
Wigner function on the complete observation level, we can rewrite (5.14) as 



,i<t> 



If we insert the explicit expression for W\ v )(£) [see Eq.( 3.5S| )] into Eq.(5.15) we obtain 



Wf^(£) = 2exp 



2a 2 + 2a 2 



2 \ 1 



(5.15) 



(5.16) 
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FIG. 2. The reconstructed Wigner functions of the squeezed vacuum state |7j) with n = 2. We consider the observation 
levels as indicated in the figure. This Wigner function can be completely reconstructed on the observation level 02- 

where Iq(x) is the modified Bessel function. We plot this Wigner function in Fig. |^b. We see that W!ff(£) is not 
negative and that it is much narrower in the vicinity of the origin of the phase space than the Wigner function of the 
vacuum state (compare with Fig. |l]a). Nevertheless the total width of Wigner function (£) is much larger than 
the width of the Wigner function of the vacuum state. 



2. Observation level Oe 



Due to the fact that for the squeezed vacuum state we have Pm — 1, the Wigner function of the squeezed vacuum 



state on the observation level Ob is equal to the Wigner function on the observation level 0a, i.e., (£) = (£) 



3. Observation level Oc 

For the squeezed vacuum state all meanvalues i-2n+i are equal to zero and therefore ^ n f2n+i = 0. From this fact 
and from the knowledge of the mean photon number h we can reconstruct the Wigner function (£) in the form 



[see Section [VB3] 



4e" 2 m 2 



A' 



<) ) (0 = ^ T fE(^) (5.17) 
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where n is the mean photon number in the squeezed vacuum state. We plot the Wigner function in Fig. |c. 

This Wigner function is very similar to the Wigner function on the observation level Oa [see Fig. gb] which reflects 
the fact that the photon number distribution of the squeezed vacuum state has a thermal-like character on the 
even-number subspace of the Fock space. 



4- Observation level Odi 

With the help of the general formalism presented in Section IV we can express the Wigner function W^iO of 
the squeezed vacuum state on the observation level Odi in the form [see Eq.(5.8)] with 



P Q = (1 -rj 2 ) 1/2 = (n + I) -172 ; and n 



1 - (l + n)-V2 



1. 



(5.18) 



We plot the Wigner function WVp{£) in Fig. |d from which the dominant contribution of the vacuum state is 
transparent which is due to the fact that the squeezed vacuum state has a thermal-like photon number distribution. 



5. Observation level O02 



If we consider n to be an even integer, then the Wigner function WiS 2 \(,) of the squeezed vacuum state on Orj2 is 
given by Eq. ( pMl| ) . The partition function Zd2 is given by Eq. ( 4.75 ) where 



,r>/2 



2™ [(n/2)!] 2 (l+n)^™)/ 2 



(5.19) 



We plot this Wigner function in Fig. ||e. It has a thermal-like character [compare with Fig. |l]g] but contribution of 
the Fock state \n = 2) is more dominant compared with the proper thermal distribution. If n is an od d inte ger, t hen 
Pn = and the corresponding Wigner function can be again reconstructed with the help of Eqs.(5.11) and (4.75). 



C. Even coherent state 

We plot the Wigner function of the even coherent state on the complete observation level in Fig. ||a. Two contri- 
butions of coherent component state \a) and | — a) as well as the interference peak around the origin of the phase 
space are transparent in this figure. As in the case of the squeezed vacuum state, the mean amplitude (a) of the 
even coherent state is equal to zero and therefore the Wigner function wj^\ ( £) of the even coherent state on the 
observation level 0\ is equal to the thermal Wigner function given by Eq.( 4.12| ). 



Observation level O2 



(2) 

Using general expressions from Section [IV A2| we can express the Wigner function W^ a ^ (£) of the even coherent 
state on the observation level O2 as 



1 



[(TV + 1/2) 2 -M 2 ] 



1/2 



exp 



£ 2 
s X 



[(N + 1/2) + M] [(N + 1/2) - M] 



(5.20) 



where £ = £ x + i£ y , and the parameters N and M read 



N = a z tanh a z 



M = a 1 



(5.21) 



We plot the Wigner function W^f ^ (£) in Fig. ||b. This Wigner function is slightly "squeezed" in the ^-direction and 
stretched in the ^-direction. Nevertheless, the reconstructed Wigner function is different from the Wigner function 
of the squeezed vacuum state [compare with Fig. ^|a] . 
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FIG. 3. The reconstructed Wigner functions of the even coherent state \a e ) with n — 2. We consider the observation 
levels as indicated in the figure. This non-Gaussian Wigner function can be completely reconstructed only on the complete 
observation level when all moments of systems operators are measured. 

2. Observation level Oa 

The photon number distribution of the even coherent state is given by the relation (we assume a to be real) : 
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P2n = r 2 P 2 n+1 = 0, (5.22) 

cosher (2n)! 



so the corresponding Wigner function can be expressed as Eq.([L4£). We can also express wf£\(£) as the phase 



averaged Wigner function of the even coherent state W| Q( ,)(£) given by Eq.( 3.62| ). After some algebra we find that 
W,^(£) can be written in a closed form 

<?)(£) = ^hrf [e- Q V (4^|e|) + e«V (4«|C|)] . (5.23) 

We plot the Wigner function W^\(£) in Fig. From this figure the dominant contribution of the Fock state |2) is 

transparent (in the present case we have Po ~ 2exp(— 2), P^ — 2Po, and P4 — 2Pq/3, while all other probabilities P n 
are much smaller) which results in negative Wigner function. 

3. Observation level Ob 



Due to the fact that the even coherent state is expressed as a superposition of only even Fock states, i.e., P^r, 

S(o=< ) > ( 



the Wigner functions on the observation levels Oa and Ob are equal, i.e., Wf£l(£) = W\ a 



4- Observation level Oc 



As a consequence of the fact that for the even coherent state all meanvalues P-m+i are equal to zero the information 

'(C) 

l«e>' 



available for the reconstruction of the Wigner function W, \ (£) is the same as in the case of the reconstruction of the 



Wigner function of the squeezed vacuum state on the observation level Oc- Therefore, the Wigner function Wf-^ (£) 
has exactly the same form as for the squeezed vacuum state with the same mean photon number n [see Fig. 0d and 
Fig.|c]. 



5. Observation level Odi 



The Wigner function W^l (£) of the even coherent state on the observation level Odi is given by Eq.(5.8) with 



° cosh a 2 ' ~ cosh a 2 — 1 ^ 

We plot the Wigner function w{-°? (£) in Fig. |e. This Wi gner function has a thermal-like character except the fact 
that the contribution of the vacuum state is slightly suppressed. 

6. Observation level Ou2 

Analogously we can find the Wigner function {£)• ^ we consider n to be an even integer, then the Wigner 



function WP? 2) (£) of the even coherent state on 0rj 2 is given by Eq.Q5.1l|) and Eq.([4.75|) where 



1 a m 

Pn = T 2 ^T' ( 5 - 25 ) 

cosher n! 



and if n is an odd integer then P^ = 0. We plot W^\C) in Fig. |f. From our previous discussion it is clear that 

in the present case the vacuum sta 
observation level 0\ - see Fig. ||c) . 



in the present case the vacuum state and the Fock state |2) dominantly contribute to Wjj^v (£) (similarly as on the 
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D. Odd coherent state 



We present the Wigner function of the odd coherent state with the mean photon number equal to two in Fig. ^a. 
The mean amplitude (a) of the odd coherent state is equal to zero and therefore the Wigner function W^* \ (£) of this 
state on the observation level 0\ is equal to the thermal Wigner function given by Eq.( 4.12| ) [see Fig. [I]g]. 



Observation level O2 



Using general expressions from Section IV B 2 we find that the Wigner function W^K (£) of the odd coherent state 



on the observation level O2 is the same as for the even coherent state [see Eq.( 5.20 )l but the parameters N and M in 
the present case read 



N = a 2 cotha 2 ; 



M 



a 



(5.26) 



We plot the Wigner function W$A£) in Fig. ^b. This is a "squeezed" -Gaussian function similar to the Wigner 
function of the even coherent state on the same observation level [see Fig. [j]b and discussion in the previous section] . 



2. Observation level Oa 



The photon number distribution of the odd coherent state is given by the relation (we assume a to be real): 

1 (a 2 ) 2n+1 



Pin — 0; P211+I — 



Consequently, the Wigner function (£) can be expressed as ( 4.48 ). Alternatively, if we use the fact that 

is equal to the phase averaged Wigner function of the odd coherent state W| tto )(£) given by Eq.(3.63), then we can 
write 



sinha 2 (2n + 1)! 



(5.27) 



e- a J (4za|^|)-e Q -J (4a|e|) 



(5.28) 



Sinn ar L J 

This function is always negative in the origin of the phase space. We plot the Wigner function (£) in Fig. ||c. 

In the present case Pq = P2 = and the Pi is the largest probability therefore the contribution of the Fock state |1) 
in Wf£\(£) is the most dominant which is clearly seen from Fig. |]c. We also note that, in general, any superposition 
of odd Fock states has a negative Wigner function on the observation level Oa- 



3. Observation level Oe 



For the odd coherent state all meanvalues P^n are equal to zero. Taking into account this information and the 
information about the mean photon number we reconstruct the Wigner function W^ a { (£) in the form (for details see 



Section IV C 2) 



4e- 2 l«l 2 



OO 



1 ^ 

k=0 



-2fc+l 



(4iei : 



(5.29) 



where n — a 2 cotha 2 . We plot this Wigner function in Fig. 



In the present case the dominant contribution of 



the Fock state |1) is seen (P = Pi — and due to the thermal- like photon number distribution on the odd-number 
subspace of the Fock state P3 is much smaller than Pi). We can co nclud e, that any superposition of odd Fock states 
on the observation level Ob has the Wigner function given by Eq.( f3.29| ), i.e., superpositions of odd Fock states are 
indistinguishable on Ob- 
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4- Observation level Oc 



Due to the fact that the odd coherent state is expressed as a superposition of only odd Fock states, i.e., ^ n P2n+\ = 
1, the Wigncr functions on the observation levels Oc and Oa are equal, i.e., W^Ut;) — W^\(£). 




FIG. 4. The reconstructed Wigner functions of the odd coherent state \a ) with n = 2. We consider the observation levels 
as indicated in the figure. 
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5. Observation level Odi 

The Wigner function wf^l\£) of the odd coherent state on the observation level Odi is given by the following 
relation [we remind us that for the odd coherent state we have Pq = 0] 

<?(0 = -^-r^|o>(0 + ^~rW th (0, (5.30) 
\ot ) n — 1 1 ' n — 1 

where n is the mean photon number in the odd coherent state; W r |o)(C) is the Wigner function of the vacuum state 



and Wth(£) is the thermal Wigner function for the state with n — 1 photons. We note that from Eq.(5.30) it follows 
that 

hm^ ) 1) (C) = ^| 1) (e), (5.31) 

We plot the Wigner function W^l (£) in Fig- ^e. Compared with Fig. [|d we see that the contribution of the Fock 
state |1) on the observation level Odi is smaller than on Ob- This is due to the fact that on the present observation 
level P2 is not equal to zero. 

6. Observation level Od2 

Reconstruction of the Wigner function (£) is straightforward. For the odd coherent state it is valid that if n 

is an odd integer, then 

1 a 2fl 

(5.32) 



and the Wigner function is given by Eq.(5.11). On the other hand if n is an even integer, then P n = and we again 

(D 



use Eq.( 5.11 ) for the reconstruction of the Wigner function W^ 2 We plot this Wigner function in Fig. ^f. Even 



though on this observation level P2 = the contribution from the vacuum state is significant and therefore W^?(£) 
is not negative in the present case. 

E. Fock state 

Mean values of the operators a k in the Fock state are equal to zero, therefore the Wigner functions W^< ;(£) and 

W \n)(0 0f the F ° ck State I 

n) on the observation levels 0\ and O2, respectively, are equal to the thermal Wigner 
function given by Eq.(4.12) [see Fig. ||b]. On the other hand the Shannon entropy of the Fock state is equal to zero, 



therefore this state can be completely reconstructed on the observation level Oa [see Fig. |5]a for the Wigner function 
of the Fock state |2)]. 



Observation level O 



B 



If the Fock state has an even number of photons then it can also be completely reconstructed on the observation 
level Ob- But if the number of photons of the Fock state is odd then the Wigner function of this Fock state on Ob is 



given by the relation (5.29) with n = n. 



2. Observation level Oc 

If the number of photons on the Fock state is odd than the corresponding Wigner function can be completely 
reconstructed on the observation level Oc ■ If the number of photons is even, then the Wigner function (£) is 



given by Eq.( 5.17 ) with n = n. We plot W&(£) in Fig. ^p. This Wigner function is the same as for the squeezed 



vacuum state W^P (£) and the even coherent state W^l (£) with the same mean photon number [see Figs. ||c and 
|3|d]. More generally, all superpositions of even Fock states with the same mean photon number are indistinguishable 
on O c . 
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3. Observation level Odi 



If the Fock state under consideration is the vacuum state then it can be completely reconstructed on the observation 
level Odi- If the number of photons is larger than zero, then Pq — and the corresponding Wigner function is given 
by Eq. (^3C| ) with n = n. We plot W^2)i0 in Fi g- 



4- Observation level Od2 

On this observation level the Wigner function of the Fock state |n) can be always completely reconstructed, because 
this observation level is defined in such way that P n = 1. 



(a) d =0a~0ii~0j. (b) tfj-Ca-CVt 




FIG. 5. The reconstructed Wigner functions of the Fock state |n = 2). We consider the observation levels as indicated in 
the figure. Even though the Fock state is represented by the non-Gaussian Wigner function it can be completely reconstructed 
on the observation level On. 



F. Observation level O n = {n,n 2 } 



We will finish this section with a brief discussion about the phase-insensitive observation level O n which is related 
to a measurement of the observables h and ft 2 . 

The generalized canonical density operator a on the observation level O n reads: 



1 1 °° 

<r n = — exp [-Ain - A 2 n 2 ] = -— exp [-Xin - X 2 n 2 ] \n)(n\. (5.33) 

The Lagrange multipliers are determined by the relations 
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dlnZ n 



m=0 



<)I,!Z " f> 2 P m , (5.35) 



SA 2 

m=0 



where 



P m = -rr cxp [-Aim - A 2 m 2 ] ; (5.36) 

and 

00 

Z n = cxp [—Aim — A2777 2 ] . (5.37) 

m— 

r 2 — /~2\ / - \ 2 ^ 



From Eqs.(5.34) and (5.35) it follows that if (ft) = N is an integer, then in the limit a n — > 0+ (where c 2 = (n 2 ) — (ft) 2 ) 



Ai = — 27VA 2 and A2 tends to infinity. Simultaneously 

P m — * $m,N> (5.38) 

which means that in this case a n — > |iV)(iV|. In other words, on the observation level O n the Fock state \N) can be 
completely reconstructed (see Fig.||a) and in this case the corresponding entropy S n — —fee J2 m ^ m m ^tn 1S equal to 
zero. 

The Wigner function of this state is negative, which in particular reflects the fact that the reconstructed distribution 
is narrower than the Poissonian (coherent-state) photon number distribution, i.e., the state under consideration 
exhibits sub-Poissonian photon number distribution. To quantify the degree of the sub-Poissonian photon statistics 
one can utilize the Mandel Q parameter defined as: 

(ft 2 ) - (ft) 2 - (ft) 

Q = W) , (5.39) 

which for Fock states is equal to -1 while for coherent states is equal to 0. The state is said to have sub-Poissonian 
photon statistics providing Q < 0. One can easily reconstruct sub-Poissonian states on the observation level O n . 
In addition states with the Poissonian photon statics Q = can be partially reconstructed on this observation level 
as well. For instance in Fig.Qh we represent a result of numerical reconstruction of the Wigner function W$(£) 
of the coherent state with a Poissonian photon number distribution on the observation level O n . In this case the 



reconstructed photon number distribution P n [see Eq.(5.36)] does not have a Poissonian character, and therefore the 



reconstructed Wigner functions of the coherent state on the observation levels Ok and O n are different (compare 
Figs.|l|b and [l]h, respectively) even though the reconstructed states have the same mean photon number (ft) and the 
same variance tr 2 in the photon number distribution. 



On the observation level O n we can reconstruct also the odd coherent state given by Eq.(3.61) which is a sub- 
Poissonian state with the Q parameter given by the relation (we assume a to be real) : 

4« 2 e~ 2a2 n 
Q= - \ 2 = —, " ^ <0, (5.40) 
l_ e -4a= (cosha 2 ) 2 v ' 

where the mean photon number n in the odd coherent state is given by the relation n = a 2 cotha 2 . We have plotted 
the result of the numerical reconstruction of the Wigner function of the odd coherent state with n = 2 on the given 
observation level in Fig.^g. Due to the fact, that for the given mean photon number the odd coherent state does not 
exhibit a significant degree of sub-Poissonian photon statistics, the corresponding Wigner function WjH (£) is not 
negative (compare with Fig.^c). 



The even coherent state (3.60) is characterized by the super-Poissonian photon statistics with the Mandel Q 



parameter given by the relation 



4a 2 e- 2 " 2 _ 

(sinh a 2 ) 2 



Q = = > °' ( 5 - 41 ) 
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with the mean photon number given by the relation n — a 2 tanha 2 . From Eq.( |5.41 ) it follows that for large enough 
values of a (i.e., for large enough values of n) the Mandel Q parameter is smaller than n (it tends to zero). In this case 
the Wigner function of the even coherent state on the observation level O n can be easily reconstructed (see Fig.|^). 
We can also reconstruct on this observation level a thermal mixture for whic h the Mandel Q parameter is equal to n 
(i.e., (n 2 ) = 2n 2 + n) In this case the Lagrange multiplier A2 in expression (5.32) is equal to zero and consequently 
the results of the reconstruction on the observation levels O n and 0th (thermal observation level) are equal. 

It is important to stress that all those states for which the Mandel Q parameter is less than n (in analogy with 
sub-Poissonian states we can call these states as the sub-thermal states) can be reconstructed on O n . For all these 



states the Lagrange multiplier A2 is greater than zero and consequently the generalized partition function (5.37) does 
exist. Nevertheless there are states for which Q > n (we will call these sta tes a s super-thermal states). For these 
state the Lagrange multiplier A2 is smaller than zero and Z n given by Eq.( [5.37 ) is diverging. Consequently, these 
states cannot be reconstructed on the observation level O n . In particular, the Mandel Q parameter for the squeezed 
vacuum state (3.54) reads Q — 2n + 1 (for h > 0) and therefore we are not able to reconstruct the Wigner function 
of the squeezed vacuum state on O n . Analogously, the even coherent state for small values of a such that sinha 2 < 1 
has a super-thermal photon number distribution and it cannot be reconstructed on this observation level. 

The mathematical reason behind the fact that super-thermal states cannot be reconstructed on O n is closely related 
to the semi-infmiteness of the Fock state space of the harmonic oscillato r, i.e ., the photon number distribution of these 
states cannot be approximated by discrete Gaussian distributions P m ( 5.36] ) on the interval m S [0,oo). In principle, 
there exist two ways how to regularize the problem: one can either expand the Fock space and to introduce "negative" 
Fock states, i.e., m G (—00, 00). Alternatively, one can assume finite-dimensional Fock space such that m € [0, s]. In 
both these cases Z n for super-thermal states is finite and in principle a n can be reconstructed (but it may depend on 
the regularization procedure). 



VI. OPTICAL HOMODYNE TOMOGRAPHY AND MAXENT PRINCIPLE 

From the point of view of th e formalism presented in this paper it follows that from the probability density 
distribution w p (xg) [see Eq.( |3.23| )l which corresponds to a measurement of all moments (xg ), the generalized canonical 
density operators a Xg [see also Eq.(2.11)]: 



1 



Ox 6 = ^— exp 

6x e 



dx$ \xg)(xg\X(xg) 

can be constructed. The Lagrange multipliers X(xg) are given by an infinite set of equations 



(6.1) 



Wp{x e ) = \ / 2irh(x g \a xe \xe}] Vx e <E (-00,00). 



(6.2) 



If probability distributions Wp(xg) for all values of 9 £ [0,7r] are known then the density operator on the complete 
observation level can be obtained in the form 



1 

9 = -77- exp 

^0 



de 



dxg \xe){xe\\(xg) 



(6.3) 



and the corresponding Wigner function can be reconstructed. The optical homodyne tomography can be understood 
as a method how to find a relation between measured distributions Wp{xg) and the Lagrange multipliers \(xg) for all 
values of xg and 9. As we have shown earlier in this section, the Gaussian and the generalized Gaussian states can 
be completely reconstructed on reduced observation levels based on a measurement of just finite number of moments 
of system observables, and therefore the optical homodyne tomography is essentially not needed as a method for 
reconstruction of Wigner functions in these cases. On the other hand, the non-Gaussian states can in principle 
reconstructed, but in practice the reconstruction of their Wigner functions is associated with a measurement of an 
infinite number of independent moments of system observables which is not realistic. In the experiments by Raymer 
ct al. |Io[ only a finite number of values of 9 have been considered, i.e., these types of experiments are associated with 
observation level for which the corresponding generalized canonical density operator reads 



1 

a = — exp 
Z v 



X h 



N x Ng 

1=1 m=l 



><*, 



CO 



(6.4) 
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A. Implementation and numerical examples 



We want to demonstrate our reconstruction scheme and compare it with known tomography scheme^] at four 
nontrivial examples. One is an incoherent superposition of two coherent states 

pi = ~(\ai)(ai\ + \a 2 ){a 2 \) , (6.5) 
the second is a superposition of two coherent states 

h = Afflai) + \a 2 )) ((ai| + (a 2 |) , (6.6) 

the third is a rectangular state 

h = \il)){i>\ (6.7) 



with 



and the last one is a Fock state 



i , . for a; € [— 2ai,2ail ,„ „. 

= elsewhere ^ 



h = \n){n\. (6.9) 
All calculations were carried out in Fock representation, where the projection operators 



read 



0/m = i4!i)<4ii ( 6 - 10 ) 

(dim) = ip*(xi)ip n2 (xi)exp(i6 m (rii-n2)), (6-11) 

V / Hi ,712 

and 9 m is the quadrature phase and xi is the eigenvalue of the operator. In the numerical examples we chose a\ = 1.25 
and «2 = 1-25 i for the first three states and n = 4 for the Fock state. 

Our numerical approach forces us to truncate the Hilbert space at a finite value n max and we must insure that an 
increase of this cut-off does not change our results significantly. On the other hand the number Ng of different angles 
9 and the number N x and separation Air of different x is given by the experiment. The error of any reconstruction 
scheme goes to zero when all x for all angles 9 are covered, i.e. when our knowledge about the state is complete. On 
the other hand for incomplete knowledge the different reconstruction schemes give different results and in this sense 
we want to compare the schemes. 

As a representation of the state and its reconstruction we show the Wigner function (see Fig. ||) . The plots in the 
upper line show the Wigner functions as surface plots, whereas the lower line shows the same functions as grey scale 
plots. The uniform grey background corresponds to the value zero whereas darker areas indicate positive values of the 



Wigner function. In (a) we show the state p\ itself as defined in (6.5) and in (b) the reconstruction p 1 as obtained via 
the maximum entropy principle. For the reconstruction we used only 4 different angles and 13 points on each axis. 
Despite this extremely small number the graphical representation of the state p l reveals no difference to the original 



3 The direct sampling method as described in Section [II can be straightfowardly applied also in the case when the quadrature 
components xg are measured at Ng discrete phases dm- As shown by Leonhardt and Munroe |37| if it is a priori known that 
pmn = for m — n\ > Ng then the density matrix elements p m „ for |m — n\ < Ng can be precisely reconstructed from the 
measured distributions w(xg,8 m ) at Ng phases 9 m . On the other hand, if the parameter x is discretize (which corresponds to a 
measurement of N x projectors \x g ^ }(xg ^ | in the direction m ), then the direct-sampling reconstruction can be applied as well, 
but may lead to "pathological" density operators which are not positively defined. Alternatively, the least-square inversion 
method (see for instance p&) can be efficiently applied. The advantage of this method is that it is a linear method which 
means that the density matrix can be reconstructed in a real time together with an estimation of the statistical error. We note 
that this method may also lead to non-positive density operators. 
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state pi. For completeness we include (c) the state as obtained via projection onto pattern functions as described in 
flBf . Contrary to (a) and (b) we also obtain white areas which correspond to a negative value of the Wigner function. 

Already from this plot it is obvious that the reconstruction via maximum entropy principle matches much better 
the original state. For a quantitative comparison we calculated 

A= J2 [(Pl)m,™ ~ ih)m,n 2 ] 2 (6.12) 
ni ,ri2 

as a measure for the error of the reconstruction. We vary the number Ng of different angles and N x of different values 
on each angle, their separation is chosen in such a way that they cover uniformly the integral [—2, 2], where — as can 
be seen in Fig. || — almost the whole state is located. The numerical cut-off for the Hilbert space was n max = 30. 

Whereas the errors A (Table |) for usual quantum tomography are of the order of one (thus on average of the 
order of 10 -3 per matrix element) the inclusion of the maximum entropy principle reduces the errors by several orders 
of magnitudes. We want to add that the large errors of usual quantum tomography of course decrease significantly 
when increasing the amount of measurement data, i.e. increasing Ng and N x . On the other hand, the reconstruction 
via projection onto pattern function is in general not positive definite which reflects some fundamental problems 
associated with this reconstruction scheme. 





Re(a) 




Re (a) 




Re (a) 



FIG. 6. Wigner function of an incoherent superposition of two coherent states (a), its reconstruction via the maximum 
entropy principle (b) and its reconstruction via projection onto pattern functions (c). The upper line shows the Wigner function 
as surface plots, whereas the lower line shows the same functions as a grey scale plot, where dark areas correspond to higher 
values and bright areas to lower values. We see that the reconstruction via the MaxEnt principle is much more reliable then a 
straightforward application of direct sampling via pattern functions. 



number of observables 


maximum entropy principle 


direct sampling 


Ng = 3 ; N x = 11 


1.0 ■ 10~ 3 


4.39 


Ng = 4 ; N x = 11 


2.8 -10- 5 


3.41 


N g = 4 ; N x = 13 


3.0 -10- 5 


1.18 


Ng = 5 ; N x = 15 


1.0 -to- 5 


0.76 



TABLE I. Deviation A of the reconstructed state from the incoherent superposition of two coherent states for reconstruction 
via the maximum entropy principle and for reconstruction via projection onto pattern functions. 
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One might suspect that the superiority of the reconstruction via the maximum entropy principle might be a 
speciality of the selected state, an incoherent superposition of two coherent st ates . Therefore we want to give some 
more examples, e.g. the corresponding coherent superposition, as defined in ( |6.6| ). As in the previous example the 
values of a\ and a-i were chosen to be 1.25 and 1.25 i, respectively (see Fig.0). 

Surprisingly enough the reconstruction turns out to be simplified by the quantum interferences apparent in the cat 
state: the deviations for both reconstruction schemes are smaller than for the incoherent superposition (Table ||). 
Apart from this the overall picture remains the same: The reconstruction with the maximum entropy principle is 
many orders of magnitudes better than the reconstruction via pattern functions. Moreover, the reconstruction via 
pattern functions may again result in density operators which are not posi tive definite. 

Next, we discuss the reconstruction of the rectangular state as defined in (6.7). Though this state is of less relevance 
in quantum optics it can be easily realized for atomic beams by an aperture. The reason, why we include this state 
into our discussion is twofold: on one hand, the oscillations in its Wigner function (see below) represent a serious 
difficulty for quantum tomography, so it is interesting to check, whether other reconstruction schemes do not have this 
difficulty. On the other hand the smoothening character of our reconstruction by selecting the state with maximum 
entropy may smooth out just these oscillations and therefore this state is a critical test of the maximum entropy 
reconstruction. 
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FIG. 7. Wigner function of a coherent superposition of two coherent states given by Eq.(6.6) (a), its reconstruction via 
the maximum entropy principle (b) and its reconstruction via the projection onto pattern functions (c). Again the upper line 
shows the state as surface plot and the lower line the corresponding grey scale plots. 



number of observables 


maximum entropy principle 


direct sampling 


N e = 3 ; N x = 11 


1.74- 10" a 


6.90 


N e = 4 ; N x = 11 


7.8 -10- 6 


4.33 


N e = 4 ; = 13 


1.3 ■ 10" 21 


1.00 


N e = 5 ; N x = 15 


1.5 ■ KT 11 


0.377 



TABLE II. Deviation A of the reconstructed state from the coherent superposition of two coherent states for reconstruction 
via the maximum entropy principle and for reconstruction via projection onto pattern functions. 
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The overall picture (Fig. ||) is the same as for the previous two examples: the reconstruction via the maximum 
entropy principle gives a deviation (see also Table [II) from the original state many orders of magnitude lower than 
conventional quantum tomography does. Once more we want to stress that the bad reconstruction by usual quantum 
tomography is due to the extremely small number of angles and grid points. Increasing the amount of measurement 
data makes this reconstruction scheme working. 

Finally, we turn to a state, which is relatively easy to reconstruct via quantum tomography: a number state (6.E) 
with n = 4. This state can be obtained even exactly with a finite number of phases N$ provided that Ng = n + 1 and 
that each quadrature is measured completely, i.e. covering densely the whole axis. Since our examples do not and 
cannot fulfill the latter condition, we encounter again a situation where the reconstruction via quantum tomography 
suffers from too few measurement data. To allow for a fair comparison we restrict the Hilbert space to N = n = 4, 
otherwise quantum tomography adds additional errors in the higher density matrix elements. 





-4-2024 
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FIG. 8. Wigner function of a rectangular state given by [see Eq. (|6.7|) 1 (a), its reconstruction via the maximum entropy 
principle (b) and its reconstruction via the projection onto pattern functions (c). 



number of observables 


maximum entropy principle 


direct sampling 


N g = 3 ; N x = 11 


4.11 ■ ur 2 


11.2 


N e = 4 ; N a = 11 


2.6 ■ KT 15 


7.88 


N e = 4 ; N x = 13 


3.8 ■ lir" 


2.89 


N e = 5; N„ = 15 


1.9 ■ 10~ 12 


2.17 



TABLE III. Deviation A of the reconstructed state from the rectangular state for reconstruction via the maximum entropy 
principle and for reconstruction via projection onto pattern functions. 
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All errors (Table [TV]) are smaller than their counterparts for the other states considered so far, which is also due 
to the smaller Hilbert space under consideration. For this state the conventional reconstruction also gives a very 
good estimate of the state (see Fig. ||) , though the absolute values of the oscillations in the Wigner function (Fig. ||) 
are not completely correct due to the finite number of measurements on each axis. But even for this state, which is 
advantageous for quantum tomography, the errors of the reconstruction via the maximum entropy principle are much 
smaller. 




Re (a) Re(n.) tie (a) 

FIG. 9. Wigner function of a Fock state (a), its reconstruction via the maximum entropy principle (b) and its reconstruction 
via the projection onto pattern functions (c). 



number of observables 


maximum entropy principle 


direct sampling 


N e = 3 ; N x = 11 


6.2 ■ 1CT 21 


0.135 


N e = 4 ; N a = 11 


1.3 ■ KT 20 


0.135 


N e = 4 ; N x = 13 


3.7- 10" 31 


0.45 


N e = 5 ; N x = 15 


1.6 ■ KT 31 


0.125 



TABLE IV. Deviation A of the reconstructed state from the Fock state for reconstruction via the maximum entropy principle 
and for reconstruction via projection onto pattern functions. 
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FIG. 10. Wigner function as grey-scale plots of the reconstruction of the incoherent superposition shown in Fig. 6. Here 
we assume measurement results with errors. The errors are proportional to a factor r], which is 10~ 2 (a), 5 ■ 10~ 2 (b), 10" 1 (c) 
and 5 ■ 10" 1 (d). 



B. Measurement errors and incompatible measurement results 



We want to discuss briefly the influence of measurement errors. Including measurement errors we encounter a 
new problem: we can not guarantee that there is any state with positi ve d efinite density matrix compatible with all 



measurement results ( |2.8| ). In other words, the set C as defined by Eq. (2^8) is empty. Practically this means that our 
numerical procedure to solve numerically the equations for the Lagrange parameters cannot converge. Fortunately it 
turns out that the set of Lagrange parameters minimizing the deviation 

(n - Tr {~pn}f + £ (o' lm - Tr [p6 lm ]) 2 (6.13) 

lm 

gives generally an excellent estimate for the state to be reconstructed. To illustrate this we take our state and spoil 
artificially our measurement results by 



J lm 



(6.14) 



o' lm is the result of the measurement with errors, whereas o; m = Tr(pO; m ) is the corresponding result one would obtain 
in an ideal measurement. The error was chosen to be proportional to the square root of o; m since this quantity is 
obtained by measuring xg several times and counting how much results fall into a certain interval. The proportionality 
factor 7/ characterizes the quality of our measurement and depends on the number of single measurements made. £/ m 
represent independent Gaussian random numbers with 

(6m) = 

(Cim&'m') = h'$mm'- (6.15) 
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FIG. 11. Wigner function as grey-scale plots of the reconstruction of the cat state shown in Fig. 7 on the basis of measurement 
results with errors. The errors are proportional to a factor rj, which is 10 -2 (a), 5 ■ 10~ 2 (b), 10 -1 (c) and 5 ■ 10 _1 (d). 

W ith these oi m a nd various parameters rj we started again the reconstruction of the incoherent superposition 
(3.5), the cat state (6J3), the rectangular state and the Fock state already discussed in the last paragraph. For all 
reconstructions we used Ng = 4 different axis' and N x = 13 points at each axis, covering the interval [—2, 2] as before. 

Fig. [l(^ shows the Wigner functions of the reconstruction of the incoherent superposition for 77 — 10~ 2 , 5 ■ 10~ 2 , 
10 _1 and 5 ■ 10 _1 . Despite the relative large value s of 7 7 — corresponding to a large error — the reconstruction is very 
good. We also calculated the error as defined in fl6.12| ) and obtained A = 4 • 1CT 3 for 77 = 10" 2 , A = 4 • 1CT 2 for 
77 = 5 • 10~ 2 , A = 5 • 1CP 2 for 77 = 0.1 and A = 0.3 for 7/ = 0.5. For 77 = 0.05 we recognize a slight asymmetry 
between the two spots, which becomes more pronounced for the highest value of 77. But still the two dots are easily 
distinguishable. Due to the random character of our calculation these numbers will vary when varying the random 
numbers — as the results of a measurements will vary from run to run. 

Next, we present the corresponding plots for the cat state (Fig. |ll|). As in the previous figure the values for 77 are 
10" 2 , 5 ■ 10~ 2 , 10- 1 and 5 • 10" 1 for (a), (b), (c) and (d), respectively. The obtained errors A are 3 • 10" 3 , 3 • 10~ 2 , 
7 • 10~ 2 and 2 • 10 _1 . We want to emphasize that for both states even for the largest value of 77 the reconstruction is 
better than the usual quantum tomography without errors. 

Now we discuss the influence of measurement errors for the rectangular state. Fig. [l^ shows the Wigner functions 
of the reconstruction with an error parameter 77 = 10~ 2 (a), 5 • 10~ 2 (b), 10" 1 (c) and 5 • 10 _1 (d). As for the previous 
states only for the highest error parameter 77 we recognize an asymmetry not present in the original state (Fig. || (a)). 
The resulting errors were 2 • 10~ 3 , 2 • 10~ 2 , 6 • 10 -2 and 3 • 10 _1 , respectively. Despite the relatively large errors the 
reconstruction is as in the previous examples very good. Only for the largest value of 77 we recognize a qualitative 
difference to the original state Fig. || (a). 
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-2 2 

Re (a) 

FIG. 12. Wigner function as grey-scale plots of the reconstruction of the rectangular state shown in Fig. 8 on the basis of 
measurement results with errors. The errors are proportional to a factor rj, which is 1CP 2 (a), 5 • 1CP 2 (b), 1CT 1 (c) and 5 • 1CP 1 
(d). 
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FIG. 13. Wigner function as grey-scale plots of the reconstruction of the Fock state shown in Fig. 9 on the basis of 
measurement results with errors. The errors are proportional to a factor rj, which is 1CP 2 (a), 5 • 1CP 2 (b), 1CT 1 (c) and 5 • 1CP 1 
(d). 
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Finally, we turn to the influence of measurement errors on the reconstruction of the Fock state. The errors A for 
the various error parameters r\ are A = 8 • 10~ 4 for rj = 1 ■ 10~ 2 , A = 1 ■ 10~ 2 for 77 = 5 • 10 -2 , A = 4 ■ 10~ 2 for 
r] = 0.1, and A = 0.7 for 77 = 0.5. Also the plots of the corresponding Wigner functions (Fig. [l3|) reveal that the 
reconstruction is very good and shows the ring-shaped structure of the original Fig. ^| (a). Only for the 77 = 0.5 the 
reconstruction is not good enough to show clearly this feature. 

Usually this kind of measurement results into a very good reconstruction of Wigner functions (such that the 
corresponding entropy is close to zero for pure states). Nevertheless, a certain attention has to be paid for highly 
squeezed states, such as the Vogel-Schleich phase states |69|, for which the measurement of distributions Wp(xe ) can 
be problematic. Namely, Wp(xg j ) can be very "wide", so that the normalization condition is not fulfilled in a domain 
of physically accessible values of xg j . 

In this Section we have presented a numerical application of the reconstruction scheme via the MaxEnt principle 
for a reconstruction of Wigner functions of quantum-mechanical states of light from incomplete tomographic data. 
We have shown that when the tomographic data are incomplete, then the reconstruction via the MaxEnt principle is 
much more reliable than the standard inversion Radon transformation scheme or the pattern-function scheme. 



VII. RECONSTRUCTION OF SPIN STATES VIA MAXENT PRINCIPLE 



In the following sections we will apply the Jaynes principle for the reconstruction of pure spin states (see also 
Jto|). Firstly, for illustrative purposes we present the simple example of the reconstruction of states of a single 
spin- 1/2 system with the help of the maximum-entropy principle. Then we will discuss the partial reconstruction of 
entangled spin states. In particular, we will analyze the problem how to identify incomplete observation levels on 
which the complete reconstruction can be performed for the Bell and the Greenberger-Horne-Zeilinger states (i.e., 
the corresponding entropy is equal to zero and the generalized canonical density operator is identical to po). 



A. Single spin-1/2 



Firstly we illustrate the application of the maximum-entropy principle for the partial quantum-state reconstruction 
of single spin-1/2 system. Let us consider an ensemble of spins-1/2 in an unknown pure state |"0o) - In the most 
general case this unknown state vector \ipo) can be parameterized as 

\ip ) = cos6>/2|l) + sin6»/2|0), (7.1) 

where |0), |1) are eigenstates of the ^-component of the spin operator s z = ^a z with eigenvalues —1, |, respectively. 
The corresponding density operator p Q = \ipo)(tpo\ can be written in the form 

PQ = ~(i+ft.ii), (7.2) 



where I is the unity operator, n = (sin 9 cosy, sin 9 sin if, cos 9); a — (a x ,a v ,a z ) are the Pauli spin operators which in 
the matrix representation in the basis |0), |1) read 



1\ „ _ / -i \ . _ f 1 

1 J ' Uy ~ [ i o r Gz ~ \ -1 



(7.3) 
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reconstructed density operator 




• 






Ph = \(l + n z a z ^j 




• 


• 




Pn = \{l + n z a z + n x a x ^j 


0(1) 
^comp 


• 


• 


• 


Pcomp = \ (i + n z a z + n x a x + n y a y ^j 



TABLE V. In this table we present three observation levels ; Og \ and oioL P associated with a measurement of the 
particular spin-1/2 operators. Bullets (•) in the table indicate which observables constitute a given observation level. We also 
present explicit expressions for the reconstructed density operators /3a; /5b and /3 CO mp. 
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To determine completely the unknown state one has to measure three linearly independent (e.g., orthogonal) 
projections of the spin. After the measur emen t of the expectation value of each observable, a reconstruction of 
the generalized canonical density operator (2.11) according to the maximum-entropy principle can be performed. In 

Table |v| we consider three observation levels defined as — {&z}, = {cr z ,a x } and Oq — {a- z ,a x , a y } = O comp 
[the superscript of the observation levels indicates the number of spins-1/2 under considerati on] . 
Using algebraic properties of the av-operators, the generalized canonical density operator (2.11) can be expressed 

as 



1 ( ?^ 1 
Po = — cxp(-A.o-) = - 



cosh | \\I — sinh |A 



X.a 
|A| 



Z = 2cosh|A|, 



(7.4) 



with A = (X x , Xy,X x ) and |A| 2 = X x + X y + X 2 Z . The final form of the po on particular observation levels is given in 
Table The corresponding entropies can be written as 



So = -PO Inpo - (1 - Po) ln(l - po), 
where po is one eigenvalue of po [the other eigenvalue is equal to (1 — Po)\ which reads as 

1 



1 



Pa 



Pb 



l + ^(a x )* + (a z y 



P 



comp 



(7.5) 



(7.6) 



It is seen that the entropy So is equal to zero if and only if po = 1. From here follows that on O^ 1 only the basis 
vect ors |0) and |1) with \{cr z )\ = 1 can be fully reconstructed. Nontrivial is Og\ on which a whole set of pure states 



(7.1) with (a y ) = (i.e., ip — 0) can be uniquely determined. For such states Sb = and further measurement of the 



Ijy on O comp represents redundant (useless) information. 



B. Two spins-1/2 

Now we assume a system composed of two distinguishable spins-1/2. If we are performing only local measurements 
of observables such as ajx ® 1^ and 1^ ® oif 1 (here superscripts label the particles) which do not reflect corre- 
lations between the particles then the reconstruction of the density operator reduces to an estimation of individual 
(uncorrelated) spins-1/2, i.e., the reconstruction reduces to the problem discussed in the previous section. For each 
spin-1/2 the reconstruction can be performed separately and the resulting generalized canonical density operator is 
given as a tensor product of particular generalized canonical density operators, i.e., p — p^ ® p^ 2 \ In this case just 
the uncorrelated states |V>o) = IV'o 1 '') ® IV'o 2 '') can be fully reconstructed. Nevertheless, the correlated (nonfactorable) 
states |V>o) 7^ | Vq 1 "* ) ® IV'q 2 ') are °f central interest. 
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TABLE VI. We present a set of observation levels on which the density operators of two spins-1/2 can be partially recon- 
structed. Bullets (•) in the table indicate which observables constitute a given observation level while empty circles (o) denote 
unmeasured observables (i.e., these observables are not included in the given observation level) for which the maximum-entropy 
principle 'predicts" nonzero mean values. 
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In general, any density operator of a system composed of two distinguishable spins-1/2 can be represented by a 
4x4 Hermitian matrix and 15 independent numbers are required for its complete determination. It is worth noticing 
that 15 operators (observables) 



K (1) ®/( 2 '/ (1) 



(7.7) 



together with the identity operator 1^ Cg> i^ 2 ' form an operator algebra basis in which any operator can be expressed. 
In this "operator" basis each density operator can be written as 



f(i) 



f(2) 



.(1) 



f(2) 



,(2) 



(7.; 



with £ 



-(2) 



Using the maximum-entropy principle we can (partially) reconstruct an unknown density operator po on various 
observation levels. Conceptually the method of maximum entropy is rather straightforward: one has to express 
the generalized canonical density operator (2.11) for two spins-1/2 in the form (7.8) from which a set of nonlinear 



equations for Lagrange multipliers \ u is obtained. 

Due to algebraic properties of the operators under the consideration the practical realization of this programme 
can be technically difficult (see Appendix A). In Table VI we define some nontrivial observation levels. Measured 
observables which define a particular observation level are indicated in Table VI by bullets (•) while the empty circles 
(o) indicate unmeasured observables (i.e., these observables are not included in the given observation level) for which 
the maximum-entropy principle "predicts" nonzero mean values. This means that the maximum-entropy principle 
provide us with a nontrivial estimation of mean values of unmeasured observables. The generalized c anon ical density 
operators which correspond to the observation levels considered in Table VI are presented in Table VII. The signs 



"©,0" are used to indicate unmeasured observables for which nontrivial information can be obtained with the help 
of the maximum-entropy principle. 
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TABLE VII. We present explicit expressions for the reconstructed density operators px of two spins-1/2 on the observation 
levels denoted in Table 2. We use the notation — (<r|f ) (fi — z,x,y; p = 1, 2) and = (o^ ® o"^ 2 ') with /i, v — z, x, y. 
The signs © and are used to indicate unmeasured observables for which nontrivial information can be obtained with the help 
of the maximum-entropy principle. 
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C. Reconstruction of Bell states 



In what follows we analyze a partial reconstruction of the Bell states (i.e., the most correlated two particle states) 



on observation levels given in Table VI. One of our main tasks will be to hnd the minimum observation level (i.e., 
the set of system observables) on which the complete reconstruction of these states can be performed. Obviously, if 
all 15 observables are measured, then any state of two spins-1/2 can be reconstructed precisely. Nevertheless, due to 
the quantum entanglement between the two particles, measurements of some observables will simply be redundant. 
To find the minimal set of observables which uniquely determine the Bell state one has to perform either a sequence 

(2) 

of reductions of the complete observation level, or a systematic extension of the most trivial observation level O a . 
Let us consider particular examples of Bell states, of the form 

|*(f e "))--i=[|l,l>+e^|0,0)], ^ell) = ^(BeU) ) ^(BeU) l (?9) 

(other Bell states are discussed later). These maximally correlated states have the property that the result of a 
measurement performed on one of the two spins-1/2 uniquely determines the state of the second spin. Therefore, 
these states find their applications in quantum communication systems p3j , |52"| . In addition, they are suitable for 
testing fundamental principles of quantum mechanics Q such as the complementarity principle or local hidden- 
variable theories 

— (2) 

Let us analyze now a sequence of successive extensions of the observation level O A 

of c of c of c of. (7.10) 



(2) I* — t! 

The observation level O a (see Table VI) is associated with the measurement of a z observables of each spin individually, 



(2) 

i.e., it is insensitive with respect to correlations between the spins. On O b both z-spin components of particular 
spins and their correlation have been recorded (simultaneous measurement of these observables is possible because 

(2) 

they commute) . Further extension to the observation level on O c corresponds to a rotation of the Stern-Gerlach 

(2) 

apparatus such that the £-spin component of the second spin-1/2 is measured. The observation level O d is associated 
with another rotation of the Stern-Gerlach apparatus which would allow us to measure the y-spin component. The 

(2) (2) (2) 

generalized canonical density ope rators on t he ob servation levels O b , O c and O d predict zero mean values for all 



the unmeasured observables (|7.7[) (see Table VII) 



In general, successive extensions ( |7.10| ) of the observation level Of should be accompanied by a decrease in the 
entropy of the reconstructed state which should reflect increase of our knowledge about the quantum-mechanical 
system under consideration. Nevertheless, we note that there are states for which the entropy remains constant when 

(2) (2) (2) 

O b is extended towards O c and O d , i.e., the performed measurements are in fact redundant. For instance, this is 
the case for the maximally correlated state d7.9|). Here entropies associated with given observation levels read 



S A = 2 In 2, S B =S C = S D = In 2, (7.11) 

respectively, which mean that these observation levels are not suitable for reconstruction of the Bell states. The 
reason is that the Bell states have no "preferable" direction for each individual spin, i.e., (oj, ) = for fj, = x,y,z 
and p — 1,2. 

From the above it follows that, for a nontrivial reconstruction of Bell states, the observables which reflect correlations 
between composite spins also have to be included into the observation level. Therefore let us now discuss the sequence 
of observation levels 

of C of C Of (7.12) 



associated with simultaneous measurement of spin components of the two particles [see Table VI ] . The corresponding 



generalized canonical density operators are given in Table VII. To answer the question of which states can be 



(2) 

completely reconstructed on the observation level O e we evaluate the von Neumann entropy of the generalized 
canonical density operator pe- For the Bell states we find that Se — ~Pe ^Pe — (1 — Pe) hi(l — pe) where pe — 
(1 — cosiy9)/2. We can also compare directly p^ BeH ' with pe- The density operator p[^ el1 ^ in the matrix form can be 
written as 



-(Bell) = I 

^ 2 



10 e - ^ 





1 



(7.13) 
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(2) 

while the corresponding operator reconstructed on the observation level O E reads 



( 1 cos<p 





V cos 1 



(7.14) 



We see that p^ el1 ' 



or 7r which means that the Bell states |4' v =o n 
^ - a (2) and a x 1] - - (2) 



and 



Pe and S[pe] = only if ip 
-j= [|1,1) ± 1 0, 0)] are completely determined by mean values of two observables a 

(2) 

that these states can be completely reconstructed on O e . We note that two other maximally correlated states 
|$±) = ^ [|0, 1) ± 1 1, 0)] can also be completely reconstructed on O^. 

The extension of O^ to OjP does not increase the amount of information about the Bell states ( |7.9| ) with (f ^ 0, tt. 
For this reason we have to consider further extension of Op 2 ' to the observation level Oq ^ (see Table [Vl| and Appendix 
A). In what follows we will show that this is an observation level on which all Bell states (^^) can be completely 
reco nstru cted. To see this one has to realize two facts. Firstly, the generalized canonical density operator pq given by 
Eq.( [2.1l' ) can be expressed as a linear superposition of observables associated with the given observation level, i.e.: 



PG 



^ is 4 2) 



= ^cxpl- A w aW <8 aff - Xsy&W <8 u 

G \ n=x,y,z 



(7.15) 



where the parameters are functions of the Lagrange multipliers X^ u . Secondly for Bell states (7J) the only 



observables which have nonzero expectation values are those associated with Oq . Namely, {a\ L> (8 o 



i-C 1 ) 



,(2) 



= 1, 



i-(l) 



ay 1 ^ (8 Uy 2 " 1 ) — cos ip and (ax 1 ' <8 <5\ 



,(2) 



-(2)\ 



sin ip. It means that all coefficients in the generalized 



canonical density operator pc given by Eq. ( |7.8| ) are uniquely determined by the measurement, i.e., pa = P v - 



(2) 

From the above it follows that Bell states can be completely reconstructed on the observation level O g . On the 

(2) 

other hand, O g is not the minimum observation level on which these states can be completely reconstructed. The 
minimum set of observables which would allow us to reconstruct Bell states uniquely can be found by a reduction of 

(2) 

Oq . Direct inspection of a finite number of possible reductions reveals that Bell states can be completely reconstructed 



on those observation level which can be obtained from { q ] when one of the observables &i> ' ® &{, ' {v = x, y, z) is 
omitted. As an example, let us consider the observation level O^) given in Table |vi] which represents a reduction of 



Oq when the observable at' <8> o)?' is omitted. Performing the Taylor series expansion of the generalized canonical 
density operator pn defined by Eq. (A. 33) one can find that the only new observable &i (8 enters the expression 



,(2) 



for the ph as indicated in Table |VII|. The coefficient t in front of ai^ (8 at' can either be found explicitly in a closed 



J2) 



analytical form (see Appendi x A) o r can be obtained from the following variational problem. Namely, we remind 
ourselves that the expression (A. 32) for pn helps us to identify those unmeasured observables for which the Jaynes 
principle of the maximum entropy "predicts" nonzero mean values. At t his stage we still have to find the particular 
value of the parameter t for which the density operator pn in Table VII leads to the maximum of the von Neumann 
entropy. To do so we search through the one-dimensional parametric space which is bounded as — 1 < t < 1. To 
be specific, first of all, for t G (—1, 1) we have to exclude those operators which are not true density operators (i.e., 
any such operators which have negative eigenvalues). Then we "pick" up from a physical parametric subspace the 
generalized canonical density operator with the maximum von Neumann entropy. Direct calculation for Bell states 
shows that the physical parametric subspace is reduced to an isolated "point" with t = (ai 1 ' <8eri 2 ^) = 1. Therefore we 
conclude that Bell states can completely be reconstructed on Oh ■ Two other minimum observation levels suitable for 



the complete reconstruction of Bell states can be obtained by a reduction of O g when either <8 erf or (g> a 



•(2) 



.(1), 



r (2) 



is omitted. On the other hand, direct inspection shows that a reduction of O g by exclusion of either a x ~ «y u y 

(8 ai 2 ^ leads to an incomplete observation level with respect to Bell states. 

(2) (2) in 

In what follows we discuss briefly two other observation levels 0\ and Oj which arc defined in Table |VI|. The 



-(i) 



-(2) 

ay or 



(2) 

observation level O \ serves as an example when one can find an an alyt ical expression for the Taylor series expansion 



of the canonical density operator pi (see Table VII) in the form (|7.8|). The coefficients (functions of the original 
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Lagrange multipliers) in front of particular observables in Eq.(7.8) can be identified and are given in Table VII 



(2") ! 

Problems do appear when O \ is extended towa rds Of . In this case we cannot simplify t he e xponential expression 



1(2 



for pj and rewrite it analytically in the form (^8) as a linear combination of the observables (7/?). In this situation one 
should apply the following procedure: firstly, by performing the Taylo r-ser ies expansion of the pj to the lowest orders 
one can identify the observables with nonzero coefficients in the form (7.8). Namely, for pj the additional observables 

(2) 



' , Ox ' and by ® a^' appear in addition to those which form Of [see Tabic |Vl| . The corresponding 

coefficients u,v,w € (—1,1) form a bounded three-dimensional parametric space (u,v,w). In the second step one 
can use constructively the maximum-entropy principle to choose within this parametric space the density operator 
with the maximum von Neumann entropy. The basic procedure is to scan the whole three-dimensional parametric 
space. At the beginning, one has to select out those density operators (i.e., those parameters u,v,w) which posses 
negative eigenvalues and do not represent genuine density operators. Finally, from a remaining set of "physical" 
density operators which are semi-positively defined the canonical density operator pj with maximum von Neumann 

(2) (2) 

entropy has to be chosen. For a completeness, let us notice that for Bell states the observation levels O) and O ) 

are equivalent to O e , i.e., pi = pj = pe- 

In this section we have found the minimum observation levels [e.g., O h } which are suitable for the complete 
reconstruction of Bell states. These observation levels are associated with the measurement of two-spin correlations 



(2) 



® <rf , fry 1 ' ® and two of the observables of ® of {y = x,y,z). Once this problem has been solved, it is 
interesting then to find a minimum set of observables suitable for a complete reconstruction of maximally correlated 
spin states systems consisting of more than two spins-1/2. In the following section we will investigate the (partial) 
reconstruction of Greenberger-Horne-Zeilinger states of three spins-1/2 on various observation levels. 



,(2) -(i) 



,(2) 



(2) 



D. Three spins-1/2 

Even though the Jaynes principle of maximum entropy provides us with general instructions on how to reconstruct 
density operators of quantum-mechanical systems practical applications of this reconstruction scheme may face serious 
difficulties. In many cases the reconstruction scheme fails due to insurmountable technical problems (e.g. the system 
of equations for Lagrange multipliers cannot be solved explicitly) . We have illustrated these problems in the previous 
section when we have discussed the reconstruction of a density operator of two spins-1/2. Obviously, the general 
problem of reconstruction of density operators describing a system composed of three spins-1/2 is much more difficult. 
Nevertheless a (partial) reconstruction of some states of this system can be performed. In particular, in this section 
we will discuss a reconstruction of the maximally correlated three spin-1/2 states - the so-called Greenberger-Horne- 
Zeilinger (GHZ) state 0: 



\*f HZ) ) 



1 
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i,i,i) 



3 1 * |0,0,0> 



Hip 



(7.16) 



Our main task will be to identify, with the help of the Jaynes principle of maximum entropy, the minimum observation 
level on which the GHZ state can be completely reconstructed. 

We start with a relatively simple observation level O b such that only £wo-particle correlations of the neighboring 
spins are measured, i.e. 



O 



(3) 



(1) 



f(2) 



U3) 



}• 



(7.17) 



The generalized density operator associated with this observation level reads 



PB 



I« ® 1^ ® 1^ + (a« ® ai 2 > (g) I^)&P ® <rf ® /( 3 > 



{I 



(i) 



r(2) 



)/(!) 



r(3) 



8 (a« ® ® a ® a^)a^ 



(7.18) 



where '©" indicates a prediction for the unmeasured observable. In particular, for the GHZ states ( 7.1 6| ) we obtain 
the following generalized canonical density operator 



J GHZ) 

Pb 



1 r 

8 



/« ® 1^ g JO) 
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a 



V ® a< 2) ® I (3) + ® a< 2) ® ^ © <t« © h ® ^ 



= i|l,l,l)(l,l,l| + i|0,0,0)(0,0,0|. (7.19) 

The reconstructed density operator p^ HZ ^ describes a mixture of three-particle states and it does not contain any 

information about the three-particle correlations associated with the GHZ states. In other words, on the phase 
information which plays essential role for a description of quantum entanglement cannot be reconstructed. This is 

due to the fact that the density operator p"g HZ ^ is equal to the phase-averaged GHZ density operator, i.e. 

= ± f/f"*> *p. (7.20) 

Because of this loss of information, the von Neumann entropy of the state pfg HZ ^ is equal to In 2. We note, that when 
the GHZ states are reconstructed on the observation levels Og) = {o'jP <8> (jj? 1 ® /C 3 ), jW eg) ® a^} (p = x,y), 



then the corresponding reconstructed operators are again given by Eq.(7.19). These examples illustrate the fact that 
three-particle correlation cannot be in general reconstructed via the measurement of two-particle correlations. 

To find the observation level on which the complete reconstruction of the GHZ states can be performed we recall 
the observables which may have nonzero mean values for these states. Using abbreviations 



w = ® *P ® ^ (3) ), u«s = ® 4 2) ® *£ 3) >, tw, = ® i {2) 



i a 



(3)\ 



C/*i^w 3 = W ® ® ^ )) iv,v,u = x,y,z), (7.21) 
we find the nonzero mean values to be 

£ziZ2 = Cz2Z3 = £«lZ3 = lj 

Cxi£C2l/3 = Cyi^xs = Cxi2/2X3 = sm V) 

CyiV2X3 = CxiysVa = (,yix-2V3 = — COSy, 

CX1S2X3 = COS 

Cyiitoi/a = " sin V- (7-22) 

We see that for arbitrary </? there exist non-vanishing three-particle correlations Cjtivaws- The observation level which 
consists of all the observables with nonzero mean values is the complete observation level with respect to the GHZ 
states. Our task now is to reduce this set of observables to a minimum observation level on which the GHZ states 
can still be uniquely determined. In practice it means that each observation level which is suitable for the detection 
of the existing coherence and correlations should incorporate some of the observables with nonzero mean values. The 
other observables of these observation levels should result as a consequence of m utual tensor products which appear 



in the Taylor series expansion of the generalized canonical density operator (2.11). It can be seen by direct inspection 
of the finite number of possible reductions that the minimum set of the observables which matches these requirements 
consists of two two-spin observables and two three-spin observables. For the illustration we consider the observation 
level 

Of = {a« ®<^ 2) ®/ ( V (1 Wf ®a<^ (7.23) 

In this case the exponent C of the generalized canonical density operator pc = exp(—C)/Zc can be rewritten as 
C = Ci + C 2 with 61 = 7i 2 o-i X) ® <ri 2) ® I {3) + 723^ (1) ® ^i 2) ® ^i 3) and C* 2 = aai 1} ® &i 2) ® &i 3) + (3a y 1] ® o£ 2) ® a y 3) . 
The operators Ci, 62 commute and further calculations are straightforward. After some algebra the generalized 
density operator pc can be found in the form 

pc = I [/w © i (2) © i (3) + ® *i 2) ® ^ (3) + ® 4 2) ® si 3) (7.24) 

L 

+CXXX2X34 15 ® ^ ® ^ + Cnaw*^ ® 4^ ® © ki****!^ ® ^ (2) ® ^ 3) 

eCx!^ (6 12 2^ 1} ® <^ 2) ® 4 3) + U«s4 x) ® <^ 2) ® 4 3) + ® <^i 2) ® 4 3) ) 

© Cvivaw (kna^ 5 ® ^i 2) ® <H 3) + ® 4^ ® 4 3) + ® ^ 2) ® 4 3) ) 
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For the GHZ states the von Neumann entropy of the generalized canonical density operator pc is equal to zero, from 
which it follows that pc — p^ HZ ^ [see Eq. ( 7.16 )1, i.e., the GHZ states can be completely reconstructed on Oq. 
Moreover, the observation level Oc represents the minimum set of observables for complete determination of the 
GHZ states. 



VIII. QUANTUM BAYESIAN INFERENCE 

The exact meanvalue of an arbitrary observable can only be obtained when a very large (in principle, infinite) 
number of measurements on individual elements of an ensemble is performed. On the other hand, it is a very 
legitimate question to ask "What is the best a posteriori estimation of a quantum state when a measurement is 
performed on a finite (arbitrarily small) number of elements of the ensemble?" . To estimate the state of the system 
based on an incomplete set of data, one has to utilize more powerful estimation schemes such as the quantum Bayesian 
inference. 

The general idea of the Bayesian reconstruction scheme (see for instance |^|) is based on manipulations with 
probability distributions in parametric state spaces Q and A of the measured system and the measuring apparatus, 



respectively. The quantum Bayesian method as discussed in the literature [ pO|j3lU42f1 is based on the assumption that 
the reconstructed system is in a pure state described by a state vector \^f), or equivalently by a pure-state density 
operator p — \^f)(^f\. The manifold of all pure states is a continuum which we denote as f2. The state space A of 
reading states of a measuring apparatus associated with the observable O is assumed to be discrete. These states are 
intrinsically related to the projectors P x q, where A^ are the eigenvalues of the observable O. 

The Bayesian reconstruction scheme is formulated as a three-step inversion procedure: 
(1) As a result of a measurement a conditional probability 



p(0,\ i \p) = Tr{P Xu6 p), (8.1) 

on the discrete space A is obtained. This conditional probability distribution specifies the probability of finding the 
result Xi if the measured system is in a particular state p. 

(2) To perform the second step of the inversion procedure we have to specify an a priori distribution po(p) defined on 
the space Q. This distribution describes our initial knowledge concerning the measured system. Using the conditional 
probability distribution p{0 1 Xi\p) and the a priori distribution po(p) we can define the joint probability distribution 
p(0, \;p) 

p(6,X i ;p)=p(d,X i \p)p (p), (8.2) 

on the space O £g> A. We note that if no initial information about the measured system is known, then the prior po(p) 
has to be assumed to be constant (this assumption is related to the Laplace principle of indifference [(72"|). 

(3) The final step of the Bayesian reconstruction is based on the well known Bayes rule p(x\y)p(y) — p(x\ y) — 
p(y\x)p(x) , with the help of which we find the conditional probability p(p\0, Xi) on the state space fi: 

P (p\6,x i )= "f:Y P l » ( 8 - 3 ) 

J n p(0,Xi,p)dn 



from which the reconstructed density operator can be obtained [see Eq.(8.4)1. 

In the case of the repeated TV-trial measurement, the reconstruction scheme consists of an iterative utilization of the 
three-step procedure as described above. After the N-th measurement we use as an input for the prior distribution the 
conditional probability distribution given by the output of the (N — l)-st measurement. However, we can equivalently 
define the TV-trial measurement conditional probability p({ } N \p) — ]X=i P(Oi, Aj \p) [the so-called likelihood function, 
which is also denoted as C(p) ] and applying the three-step procedure just once to obtain the reconstructed density 
operator 

M r ; x _ JqKpK }N)pdn 

P({ }N) - — r — pr, j . , , (8.4) 
JnP{P\{ }N)dn 



where p in the r.h.s. of Eq.(B.4) is a properly parameterized density operator in the state space £1. We note that 



in general, the reconstructed density operator (8.4) corresponds to a mixed state inspite of the fact that an a priori 



55 



assumption is that the system is in a pure state. This deviation from the purity (let say expressed in terms of the 
von Neumann entropy) may serve as a measure of fidelity of the estimatimation procedure^ 

At this point we should mention one essential problem in the Bayesian reconstruction scheme, which is the deter- 
mination of the integration measure . The integration measure has to be invariant under unitary transformations 
in the space f2. This requirement uniquely determines the form of the measure. However, this is no longer valid when 
f2 is considered to be a space of mixed states formed by all convex combinations of elements of the original pure state 
space Q. Although the Bayesian procedure itself does not require any special conditions imposed on the space Q, the 
ambiguity in determination of the integration measure is the main obstacle in generalization of the Bayesian inference 
scheme for reconstruction of a priori impure quantum states. We will show in later in this Section that this problem 
can be solved with the help of a purification ansatz. We will also discuss in detail how to apply the quantum Bayesian 
inference for a reconstruction of states of a spin-1/2 when just a finite number of elements of an ensemble have been 
measured. Before we do this we will analyze the limit of large number of measurements. 

A. Bayesian inference in limit of infinite number of measurements 

The explicit evaluation of an a posteriori estimation of the density operator p{ }n is significantly limited by 
technical difficulties when integration over parametric space is performed [see Eq.( |B.4| )]. Even for the simplest quantum 
systems and for a relatively small number of measurements, the reconstruction procedure can present technically 
insurmountable problems. 

On the other hand let us assume that the number of measurements of observables Oi approaches infinity (i.e. 
N — ► oo). It is clear that in this case the mean values of all projectors (P x . associated with the observables Oi 
are precisely known (measured): i.e. 

(4,a>=4 (8-5) 



where ■ a] = 1. In this case the integral in the right-hand side of Eq^^) can be significantly simplified with the 
help of the following lemma: 

Lemma: 

Let us define the integral expression 

/■i rv2 ryn-i 

I{a 1 ,...,a n -x)= / dxi I dx 2 ... dx n -% F(x%, . . . ,x n -i\ ax, . . . , a n _i). (8.6) 

Jo Jo Jo 

where 

F{xx,...,x n -x\ax,...,a n -x) = — x x x x 2 2 ■■■x n _ 1 (1 - Xx ■ ■ ■ - x n -x) n ■ (8.7) 

t> 

and aj satisfy condition a, = 1. The integration boundaries are given by relations: 

k-l 

y k = 1-J2 x f> k = 2, . . . ,n — 1. (8.8) 

3=1 

and B equals to the product of Beta functions B(x,y): 

B = B(a n + 1, a n -x + l)B(a„ + a n _i + 1, a„_ 2 + 2) . . . B(a n + a n _i . . . a 2 + 1, ax + n - 1). (8.9) 

i. The function F(xx, ■ ■ ■ ,x n -i\ ax, ■ ■ ■ ,a„_i) in the integral ( |3.6| ) is a normalized probability distribution in the 
(n — l)-dimensional volume given by integration boundaries. 

ii. For N — > oo, this probability distribution has the following properties: 

(a*)-»a< (a£}-»a? * = 1, 2, 3, . . . , n - 1, (8.10) 



4 We note that Hradil |Q has recently proposed another statistical quantum-state-reconstruction method related to the 
Bayesian scheme considered in this Section. His method is based on the maximization of the likelihood function C(p). 
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i.e., this probability density tends to the product of delta functions: 

, a n -i) = 5{xi - a\)5(x2 - a 2 ) . . . S(x n -x - a„_i). 



lim F(xi, . . . ,x n -i\ ai, 

N—>oo 



Proof: 

Statement i. can be derived by the successive application of the equation [see for example j74j, Eqs.(3.191)] 

v u ~ l {u - X y- l dx = u M+, '- 1 B(ji, v). 



.11) 



3.12) 



Statement ii. can be obtained as a result of straightforward calculation of limits of certain expressions containing 
Beta functions with integer-number arguments. In our calculations we have used the identity 



B(n + 1, m) 



B(n,m) n + m 
which is satisfied by Beta functions with integer-number arguments. 



(8.13) 



B. Conditional density distribution 



Let us start with the expression for conditional probability distribution p({ } N \p) for the A-trial measurement of a 
set of observables Oi . If we assume that the number of measurements of each observable Oi goes to infinity then we 
can write: 



lim 



rii 

n[n 



Tr P 



3 = 1 



a'N 



(8.14) 



The first product on the right-hand side (r.h.s.) of Eq.(8.14) is associated with each measured observable Oi on a 
given observation level. The second product runs over eigen values rii of each observable O,;. 

In what follows we formally rewrite the r.h.s. of Eq. 
following integration 



14): we insert in it a set of (5-functions and we perform the 




dxi 



Tr 



Tr 



(V.A')]nc 



' N 



(1 



-l) C 



N 



.15) 



In Eq.(8.15) we perform an integration over a volume determined by the integration boundaries y\ [see Eq.(] 
due to the condition ^(^x 6 ;P) ~ ^' there is no need to perform integration from — oo to oo. 

At this point we utilize ou r Le mma. To be specific, firstly we separate in Eq.(B.15) the te rm, which corresponds to 
the function I given by Eq.([B~6|). Then we replace this term by its limit expression ( 3.11 ), After a straightforward 
integration over va riab les x l j we finally obtain an explicit expression for the conditional probability p(p\{ } N ^oc.) which 



we insert into Eq.(8.4), from which we obtain the expression for an a posteriori estimation of the density operator 
p({ }n^oo) on the given observation level: 



i r \n%—l 

pa }^j= 77 jji n*K* 



pdf, 



(8.16) 



Here A/" is a normalization constant determined by the condition Tr [p({ Ijy^^)] = 1. 

The interpretation of Eq.(8.16) is straightforward. The reconstructed density operator is equal to t he sum of 
equally- weighted pure-state density operators on the manifold fi, which satisfy t he co nditions given by Eq.( |S.5| ) [these 
con dition s are guaranteed by the presence of the 5-functions in the r.h.s. of Eq.( |8.16 )]. In terms of statistical physics 
Eq. fl3.16 ) can be interpreted as an averaging over the generalized microcanonical ensemble of those pure states which 
satisfy the conditions on the mean values of the measured observables. Consequently, Eq.( 8.16| ) represents the principle 
of the "maximum entropy" associated with the generalized microcanonical ensemble which fulfills the constraint (|3.5|) . 
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C. Bayesian reconstruction of impure states 



In classical statistical physics a mixed state is interpreted as a statistical average over an ensemble in which any 
individual realizations is in a pure state. This is also true in quantum physics, but here a mixture can also be 
interpreted as a state of a quantum system, which can not be completely described in terms of its own Hilbert space. 
That is the system under consideration is a nontrivial part of a larger quantum system. When we say nontrivial, we 
mean that the system under consideration is quantum-mechanically entangled 0] (see also ]7^| ) with the other parts 
of the composite system. Due to the lack of information about other parts of this complex system, the description of 
the subsystem is possible only in terms of mixtures. 

Let assume that the quantum system P is entangled with another quantum system R (a reservoir). Let us assume 
that the composed system S {S = P x R) itself is in a pure state |^). The density operator pp of the subsystem P 
is then obtained via tracing over the reservoir degrees of freedom: 

p P = Tr* \ps] ; fo = |*)<*|. (8.17) 

Once the system S is in a pure state, then we can determine an invariant integration measure on the state space of 
the composite system S and then we can safely apply the Bayesian reconstruction scheme as described in Section III. 
The reconstruction itself is based only on data associated with measurements performed on the system P. When the 
density operator ps is a posteriori estimated, then by tracing over the reservoir degrees of freedom, we obtain the a 
posteriori estimated density operator pp for the system P (with no a priori constraint on the purity of the state of 
the system P) . These arguments are intrinsically related to the "purification" ansatz as proposed by Uhlmann |76| . 

To make our reconstruction scheme for impure states consistent, we have to chose the reservoir R uniquely. This 
can be done with the help of the Schmidt theorem (see Ref . p|,|f7| ) from which it follows that if the composite system 
S is in a pure state | <J/) then its state vector can be written in the form: 



M 

I*) = ^c l \a t ) P <g> \/3i) R , (8.18) 



where \oci) P and \/3i) R are elements from two specific orthonormalized bases associated with the subsystems P and 
i?, respectively, and Cj are appropriate c ompl ex numbers satisfying the normalization condition ^ jc^j 2 = 1. The 



maximal index of summation (M) in Eq.(8.18) is given by the dimensionality of the Hilbert space of the system P. 
In other words, when we apply the Bayesian method to the case of impure states of M-level system, it is sufficient 
to "couple" this system to an Af -dimensional "reservoir". In this case the dimensionality of the Hilbert space of 
the composite system is 2M. Using the standard techniques (see Appendix B) we can then evaluate the invariant 
integration measure on the manifold of pure states and we can apply the quantum Bayesian inference as discussed 
above. We stress once again, that using the purification procedure we have determined the invariant integration 
measure on the space of pure states of the composite system. 

Concluding this Section we note that there also exists another approach to the problem of the integration measure on 
the space of impure states. Namely, Braunstein and Caves J78| used statistical distinguishability between neighboring 
quantum states to define the Bures metric [f79| on the space of all (pure and mixed) states of the original system S 
(see also recent work by Slater p0|). The two approaches differ conceptually in understanding what is an impure 
quantum- mechanical state. That is, in our approach we assume that impurity results as a consequence of the fact 
that the system under consideration is entangled with some other system. The other approach accepts the possibility 
that an isolated quantum system can be in a statistically mixed state (we will not discuss consequences of these two 
conceptually different approaches here, but this problem definitely deserves due attention). 



IX. RECONSTRUCTION OF SPIN STATES VIA BAYESIAN INFERENCE 

We start this section with the Bayesian reconstruction of spin-1/2 states on various observation levels. That is, we 
investigate how the best a posteriori estimation of the density operator of the spin-1/2 system based on an incomplete 
set of data (in this case the exact mean values of the spin observables are not available) can be obtained. We have 
already stressed the fact that the Bayesian inference scheme as introduced by Jones |12j is suitable only for pure 
states. This means that the completely reconstructed density operator has to fulfill the purity condition 

l(^)| 2 + l(^>| 2 + |(a 2 )| 2 = l. (9.1) 

We start our example with a definition of the parametric state space associated with the spin-1/2. The rigorous 
way to determine this parametric state space Q is based on the diffcomorphism between Q and the quotient space 
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su ^ \u(n—i) 7 where n is the dimensionality of the Hilbert space of the measured quantum system. In a particular 
case of the spin-1/2 we work with the commutative group U(l) and the construction of £1 is very simple. The space 
f2 can be mapped on t o the Poincare sphere and the parameterized density operator (i.e. the point on the Poincare 



sphere) is given by Eq^T^). The topology of the Poincare sphere determines also the integration measure for which 
we have d n = sin 9 ddd(j) (for more details see Appendix B). 

The observables associated with the spin-1/2 are spin projections for three orthogonal directions represented by 
Hermitian operators §j = frj/2. These observables have spectra equal to In what follows we distinguish between 

these two possible measurement results by the sign, i.e. s = ±1. The projectors P s ,$i on to the corresponding 
eigenvectors are 

Ps,h = 1 + 2 Sg ' ; i = x,y,z, (9.2) 
and the conditional probabilities associated with this kind of measurement can be written as 

1 — |~ S T ' 

p(s, Si\p{8, <t>)) = — i = x,y, z, (9.3) 



where we use the parameterization p(6,<p) — (1 + fa)/ 2 [see Eq.( |7.2| )]. Now using the procedure described in Sec- 
tion VIII , we can construct an a posteriori estimation of the density operator p({ } n) based on a given sequence of 



measurement outcomes on different observation levels. 



A. Estimation based on results of fictitious measurements 



In Table VIII we present results of an a posteriori estimation of density operators based on data obtained from 
"experiments" performed with three Stern-Gerlach devices oriented along the axes x, y, and z. We first discuss in 
detail reconstruction of a single spin-1/2 state under the a priori assumption that the system is in a pure state. 
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TABLE VIII. Results of a posterior Bayesian estimation of density operators of the spin-1/2 are presented for two different 
cases: (1) when it is a priori assumed that the spin is in a pure state and (2) when no a priori constraint on the state is 
imposed. In this second case the generalized Bayesian scheme has been applied. We also present values of von Neumann entropy 
[see Eq.(6.13)] associated with the given estimated density operator. In the case of a reconstruction of pure states, the value of 
the von Neumann entropy reflects the fidelity of the estimation. 
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1. Observation level = {s z } 



The first five lines in Table VIII describe results of a fictitious measurement of the spin component s z and the 
corresponding estimated density operators. In particular, let us assume that just one detection event (spin "up" , i.e. 
t) is registered in the given Stern-Gerlach apparatus (associated with the meas urem ent of s z ). Taking into account 
the parameterization of the single spin-1/2 densit y op erator expressed by Eq.(7.2) we find for the corresponding 
conditional probability distribution p(s, §i\p(9, 4>)) ( P-3[ ) the expression 



p{s,Si\p(e,4>)) 
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COS ( 



(9.4) 



Using Eq.(p.4|) we can express the estimated density operator based on the registration of just one result (spin "up") 
as 
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8^ J 



sin <I9 I d(f>(l + cos6>)(l + sin 9 cos 4><j x + sin 9 sin (f>& y + cos9a z ) = — (l + 



(9.5) 



We stress that we started our estimation procedure with an a priori assumption that the measured system is in a pure 
state, for which the vonNeumann entropy S (7.4) has to be equal to zero. But the estim ated density operator (9.5) 
describes a statistical mixture with the vonNeumann entropy S ~ 0.637 (see Table VIII). There is no contradiction 
here. In the reconstruction of pure states, a nonzero value of the vonNeumann entropy of the estimated density 
operator reflects the fidelity with which the reconstruction is performed. That is, before any measurement is performed, 
the "estimated" density operator is p = 1/2, for which the vonNeumann entropy takes the maximal value S = In 2 ~ 
0.693. As soon as the first measurement is performed, some information about the state of the system is acquired, 
which is reflected by the decrease of the entropy and a better estimation of the density operator. The estimated density 
operator is expressed as a statistical mixture because it is equal to a specifically weighted sum of a set of pure states [see 
the reconstruction formula (B.4)] which also reflects our incomplete knowledge about the state of the measured system. 
Obvio usly, the more measurements we perform, the better the estimation can be performed (compare lines 2-5 in 
Table VIII ). Nevertheless, we have to stress that the vonNeumann entropy is not a monotonically decreasing function 
of a number of measurements. To be specific, in the case when just a small number of measurements is performed, 
the estimation is very sensitive with respect to the outcome of any additional measurement. Comparing the lines 
2 and 3 in Table VIII, we see that the entropy "locally" increases in spite of the fact that more measurements are 
performed. Nevertheless, in the limit of large number of measurements, the entropy approaches its minimum possible 
value associated with a given measurement. Providing the quorum of observables is measured, the entropy tends to 
zero and the state is completely reconstructed. 

In general, increasing the number of measurements improves the a posterior estimation of the density operator on 
the given observation level (see lines 2-5 in Table VIII). Using the general results of Section VIIl we can evaluate 



the a posteriori estimation of the density operator of the spin-1/2 system on the observation level O]? in the limit of 
infinite number of measurements of the spin component s z . We note, that in this case, when observable has only two 
eigenvalues, the information obtained in the spectral distributio n (|S.5D is equivalently given only by the mean value 
of this observable. Once we know the spectral distribution Eq. ( |S . 5[ )~corresponding to the measurement of the spin 
projection s z of single spin-1/2, then with the help of Eq.(B.16) we can express the reconstructed density operator as 
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p = — : / d(f> sin9d9 5({a z ) — cos#)(l + sin# cos 4>a x + sin sine/) a y + cos# a z ), 
■A/ Jo Jo 



(9.6) 



where Af is the normalization constant such that Trp = 1. Integration over the variable (f> in Eq.(9.6) cancels all terms 
in front of the operators a x and a y and we obtain 



p = — ■. I sin9d9 5((ct z ) — cos#)(l + cos#cr 2 ). 

A Jo 



(9.7) 



The right hand side of this equation suggests a simple geometrical interpretati on o f the quantum Bayesian inference 
in the limit of infinite number of measurements. Namely, the density operator (9.7) can be understood as an equally- 
weighted average of all pure states with the same (i.e., measured) mean value of the operator s z . T hese states are 
represented as points on a circle on the Poincare sphere. When we perform integration over 9 in Eq.(9.7) we obtain 
the final expression 



(9.8) 
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for the density operator on the given observation level. Formally this is the same density operator as that reconstructed 
with the help of the Jaynes principle [see Tab. 0. But there is a difference: the formula (9.8) is obtained as a result of 



averaging of the generalized microcanonical ensemble of pure states, while the reconstruction via the MaxEnt principle 
is based on an averaging over the generalized grand canonical ensemble of all states. The two reconstruction schemes 
differ by the a priori assumptions about the possible states of the measured system. As we will see later, these 
different assumptions result in different estimations (see below). 



2. Observation level Og^ = {s z ,s x } 



The results of a numerical reconstruction of the dens ity operator of the spin-1/2 based on the measurement of two 
spin components s z and s x are presented in Table VIII (lines 6-9). The lines 1-4 and 6-9 describe estimations based 
on the same data for the s z measurement, but they differ in the data for the s x measurement. That is, the lines 1-4 
describe the situation for which no results for s x are available, while lines 6-9 describe the situation with specific 
outcomes for the s x measurements. Comparing these two cases (i.e., if we compare the values of the von Neumann 
entropy for pairs of lines {x, x+5}; x — 1, 2, 3, 4) we see that any measurement performed on the additional observable 
(s x ) can only improve our estimation based on the measurement of the original observable (s z ). 

In the limit of infinite number of measurements, when we have information about the spectral distribution corre- 
sponding to measurement of spin projections s x , s z the particular form of Eq.(8.16) reads 
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p = -r? d<f> sin9d9 5 ((a z ) — cos 9)5 ((& x ) — sin 9 cos 
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cos9a z ). 



(9.9) 



As seen from the right-hand side of Eq.(9.9) in this case the reconstructed density operator is represented by an 
equally weighted sum of points given by an intersection of two circles lying on the Poincare sphere. These two circles 
are specified by the two equations (a z ) — cos 9 and (a x ) = sin 9 cos <f>. 
With the help of the identity 
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we can perform the integration over <f> in Eq.(|9.9|) and obtain 
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(9.10) 



(9.11) 



The integration boundaries C on the right-hand side of Eq.(9.11) are defined as 

£ ■= < 9 < 7T and | sin0| > |(<7 X )|. 



(9.12) 



The sum on the right-hand side of Eq.(9.11) refers to two values of the parameter tpo which fulfill the condition 
cos 4>o = {o' x )/sm9. We note that the function in front of the operator <j y disappears due to the fact that it is 
proportional to sin0o/| sin</>o|, which is an odd function of <j)$. After we perform the integration over 9 we obtain 



p = ^ (l + (vx)<t x + (<5-z)o>) 



(9.13) 



What we see again is that in the limit of a large number of measurements the Bayesian inference formally gives us 
the same result as the Jaynes principle of maximum entropy [see Tab. 0. 



3. Observation level 0^' = {s z , s x , s y } 

. level Og leads us to the complet< 
comp onent s s x ,s y and s z of the spin-1/2 are measured. Results of the numerical reconstruction are presented in 



Further extension of the observation level Og leads us to the complete observation level, when all three spin 



Table VIII (lines 10-13). Now we compare the a posteriori estimation of density operators based on data presented in 
lines 6-9. The "experimental data" in line 10 are equal to those presented in line 6 except that now some additional 
knowledge concerning the spin component s y is available. We note that this additional information about s y improves 
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our estimation of t he d ensity operator which is clearly seen when we compare values of the von Neumann entropy 
presented in Table VIII . 

Providing that we have information concerning the spectral distribution associated with the measurement of a 
complete set (i.e. the quorum) of operators s x , s y , s z (i.e., after an infinite number of measurement s of t he three spin 
components have been performed), then we can express the estimated density operator as [see Eq.(S.16)] 
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X (1 + sin 9 cos 4>a x + sin 9 sin <f>a y + cos 9o z 



(9.14) 



The integral on the right-hand-side of Eq. ( 3.14 ) can only be performed if the pu rity c ondition ( j9~l| ) is fulfilled, otherwise 
it simply does not exist. When the purity condition is fulfilled then from Eq.(9.14) we obtain 



1 



(9.15) 



Here we can again utilize a si mple geometrical interpretation of the limit formula (9.14) for the Bayes inference. The 
three 5-functions in Eq.( 9.14 ) correspond to three specific orbits (circles) on the Poincare sphere each of which is 
associated with a set of pure states which posses the measured value of a given observable s$. The reconstructed 
density operator then describes a point on the Poincare sphere which coincides with an intersection of these three 
orbits. Consequently, if the three orbits have no intersection the reconstruction scheme fails, because there does not 
exist a pure state with the given mean values of the measured observables. 



We illustrate this failure of the Bayesian inference scheme in lines 14-17 of Table |VII|. Here we present a numerical 
simulation of the measurement in which all three observables are measured. It is assumed that the spin - 1/2 is in the 
state with (a z ) — 1/2 and (a x ) = (a y ) = 0, which apparently does not fulfill the purity condition (9.1). For a given 
set of measurement outcomes (line 14) the Bayesian inference scheme provides us with an a posteriori estimation 
such that (<j 2 ) = 101/161 which is above the expected mean value which is equal to 1/2. Moreover if we increase 
the number of measurements (lines 15-17) the a posteriori estimation deviates more and more from what would be 
a correct estimation (i.e., results presented in lines 14-17 correspond to the following sequence of mean values of a z : 
0.481; 0.375; 0.345; 0.332) but simultaneously the vonNeumann entropy S decreases, which should indicate that our 
estimation is better and better. This clearly illustrates the intrinsic conflict in the estimation procedure. 

The reason for this contradiction lies in the a priori assumption about the purity of the reconstructed state, i.e. the 
mean values of the spin components do not fulfill the condition (9.1) and so the Bayesian method cannot be applied 
safely in the pres ent c ase. The larger the number of measurement the more clearly the inconsistency is seen and, as 
follows from Eq. ( |9.14 ) , in the limit of infinite number of measurements the Bayesian method fails completely. On the 
other hand the Jaynes method can be applied safely in this case. The point is that this method is not based on an a 
priori assumption about the purity of the reconstructed state. The Jaynes principle is associated with maximization 
of entropy on the generalized grand canonical ensemble, which means that all states (pure and impure) are taken into 
account. 

In the present example the discrepancy between the a posteriori estimations of density operators based on the two 
different schemes has appeared only on the complete observation level. For more complex quantum-mechanical systems 
the difference between the density operator reconstructed with the help of the Jaynes principle of maximum entropy 
and the density operator obtained via the Bayesian inference scheme may differ even on incomplete observation levels. 
To see this we present in the following sections an example of reconstruction of density operators describing states of 
two spins- 1/2. 



B. Quantum Bayesian inference of states of two spins- 1/2 



In order to apply the general formalism of quantum Bayesian inference as described in Section VIII we have to 
properly parameterize the state space of the quantum system under consideration. Once this is done we have to 
find the invariant integ ratio n measure oIq associated with the state space and only then can we effectively use the 
reconstruction formula (8.3). We start this section with a description of how the state space of two spins-1/2 has to 
be parameterized and we show how the integration measure can be found. 
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1. Parameterization of two-spins- 1/2 state space 



One way to determine the state space O of a given quantum-mechanical system is via a diffeomorphism Q = 
\u(n-i) ■ This directly provides us with information about the dimensionality of Q, which is (dimsu(n) — 
dimu(n-i)) = 2n — 2. This means that in our case of two spins-1/2 which are prepared in a pure state we need 
6 coordinates which parameterize il {n = 4) . Unfortunately, it is not very convenient to determine the state space via 
the given diffeomorphism because then we have to work with noncommutative groups. 

It is much simpler to parameterize the state space Q utilizing the idea of the Schmidt decomposition |l];[77j . In this 
case we can represent any pure state \^} describing two spins-1/2 as: 



|*>=A|Ti>®|Ta>+£|ii><8|ia>, 



(9.16) 



where ||j),|Tj)i are two general orthonormalized bases in H 2 and A, B are two complex numbers satisfying the 
condition \A\ 2 + \B\ 2 = 1. The corresponding density operator of a pure state in fl then reads 
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The projectors |TjXTjl an d |J.jXijl (j = 1, 2) are given by (1 + r^S 3 ) and (1 — fdia 3 ), respectively [see Eq.(|9.2D], 
where r' 1 ' and r< 2 > are two arbitrary unity vectors. The operators ||j)(Tj| an d their Hermitian conjugates |Tj)(lj| are 
determined as 



(9.18) 



from which the relation 



(9.19) 



follows. Here the vectors fco> are two arbitrarily chosen unity vectors which satisfy the condition fcO' _L rti) , and 
101 are equal to vector products £Cf) = r (j > x fco). A particular choice of vectors kj is not important because phase 
factors e l ^ j [ if>j G (0, 27r)] rotate them along all possible directions. We also note that the phase factors can 
be always incorporated in the phase tp of a complex number AB* . Using the parameterization \A\ = cos(a/2) and 
\B\ = sin(a/2) we can parameterize p as: 
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sin a cos 



4 4 
where -0, 0i, 02 £ (0, 27r); a, 0i, 02 € (0, 7r) and 
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few) — (sin^j, — cos0j, 0); 
Z (i > = (cos 0j cos 0j , cos 0j sin 0j , — sin 8j); 
r^) = (sin 0, cos 0, ■, sin 7 - sin0,-, cos 0,)- 



(9.21) 



Once we have parameterized the state space fl we can find the invariant integration measure do (see Appendix B) 
which reads 



dn = cos a sin a sin Q\ sin 02 dadipdtfii dB\ d(f>2 d0 2 



(9.22) 
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C. Quantum Bayesian inference of the state of two-spins-1/2 



To perform the Bayesian reconstruction of density operators of the two-spins-1/2 system we introduce a set of 
projectors associated with the observables 
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The corresponding conditional probabilities can be expressed as 
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where s is the sign of the measured eigenvalue. Here we comment briefly on the physical meaning of the projectors 



defined by Eq.(9.23). Namely, the single-particle projectors of the form P s _<i) are associated with a measurement of 



the spin component of the first particle in the i-direction (i = x, y, z). Obviously this spin component can have only 
two values, i.e., "up" (s = 1) and "down" (s = —1). 
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lricl 1 5787206 i. 0, 1 1 5542104 1 0, s. 1 7953979 - - 1 
4 [1 89 1 + 10656539 CT 2 W -I 1 10556539 1 W ^ 1 10556539 ° z 83 °* J 


0.889 


17. 


t 


t 


t 


|[1 ® 1 + \{o z ®\ + \®d z ) + \d z ® a z ) 


1.303 


18. 


t 4 


t 4 


t 4 


|[i ® i + ® 1 + 1 ® *.) + OT^ ® *.] 


0.883 


19. 


fi 


fi 


^6 


1 T1 1 1 3126849 f i ,0, 4 1 1 ,0, i ■( 1 3873174 2. k> - 1 
4L 1 ® 1 + 6044314 ® 1 + 1® <7aJ + 6044314^ ®^J 


0.988 


20. 


f°i 2 




Jsi2 


1 Tl r<>i 1 1 1372112265600137 /- „f , f « - \ , 1766795516375970 - i 1 
4 [1 IS 1 "I" 2298883X43280046 ^ W 1 1 1{ < Ja z) 1 2298883143280046 W 


0.831 



TABLE IX. Results of a posterior Bayesian estimation of density operators of the two-spin- 1/2 system. We also present 
explicit values of the von Neumann entropy associated with given measured data. 
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In Tables VIII and IX we will denote outcomes of the measurements "up" and "down" as f and J., respectively. The 
two-particle projectors P .<i) .(2) are associated with measurements of correlations between the two spin. Namely, if 

I i j 

s = 1, the two spins are correlated, which means that they both are registered in the same, yet unspecified, state (that 
is, both spi ns are registered either in the state | T1T2) or | jila))- 

In Tables VIII and IX we will denote this outcome of the measurement as |- On the contrary, if the particles are 
registered as anticorrelated, that is after the measurement they are in one of the two states | T1I2) or | J, 1T2) j then 
s = — 1. In Tables VIII and IX we will denote the outcome of this measureme nt for <7j ® &j as |. 

Now we can apply general rules of Bayesian inference presented in Section VIII. for a two-spins- 1/2 system. We 



will consider three specific incomplete observation levels and we will derive asymptotic expressions for the density 
operators in the limit of large number of measurements. We stress here that we assume the measured system to 
be prepared in a pure state. To be specific, let us suppose that the two spins are prepared in a state described by 
the state vector (obviously, this can be determined only after an infinite number of measurements on the complete 
observation level is performed) 



|*) = A|t)® IT) + b\d® li), 



(9.25) 



where | f) and ||) are eigenstates corresponding to the observable of the spin projection into the z-direction (i.e., 
(a z ® 1) = (1 ® a z ) = \A\ 2 — \B\ 2 and (a z ® a z ) = 1). When we ass ume t he coefficient s \A\ and B\ to be real, then 



we can rewrite the density operator associated with the state vector ( |9.25 ) in the form (9.20), i.e 



I® 1 



A 2 - B 2 



(<7z 



1 + 1 



AB . 



(9.26) 



with tp = 0, 4>\ = 7r/2, 6\ = 0, 4>2 — ft/ 1 }, 02 = and sin a/2 = A. In what follows we perform a posteriori estimation 
of the density operator based on incomplete data obtained from three different fictitious measurement sequences. 



1. Observation level = {s z r> ,s^} 
In the first sequence of measurements we reconstruct a density operator from data which refer to a measurement 



of the first spin-1/2 in the direction z, i.e., only the spin component s z 1) is measured (see lines 1-4 in Table IX). 
We see that if only one spin is measured, then the reconstructed two-spin density operator can be factorized, while, 
as expected, the state of the unmeasured spin is estimated as p — 1/2. Obviously, in this kind of measurement, 
correlations between the two spins cannot be revealed, i.e., the estimated value of a z ® a z is equal to zero. As in the 
case of the reconstruction of a single-spin- 1/2 state, the reconstructed density operators describe statistical mixtures 
and the corresponding von Neumann entropy is directly related to the fidelity of the reconstruction. The maximum 
value of the vonNeumann entropy is in the case of two-spins-1/2 equal to S = In 4 ~ 1.386. This entropy is associated 
with the "total" mixture of the two-spin-1/2 system and in our case it reflects a complete lack of information about 
the state of the measured system (i.e., we have no knowledge about the state before a measurement is performed). As 
soon as the first measurement is performed, we gain some knowledge about the state of the system and the entropy 
of the estimated density operator is smaller than In 4 (see line 1). 

Let us assume now that data from the measurement of the spin components and s' z of the first and the second 



particle (spin-1/2), are available. In Table IX (lines 5-8) we present results of a reconstruction procedure based on 
the given "measured" data. We see that though correlations between the two spins have not been measured directly 
our estimation procedure provides us with a nontrivial estimation for this observable (i.e., the density operator cannot 
be factorized). Obviously, this estimation is affected by the prior assumption about the purity of the reconstructed 
state. We see that with the increased number of detected spins the von Neumann entropy of the estimated density 
operator decreases (we note that it does not decrease monotonically as a function of the number of measurements). 

In the limit of large (infinite) number of measurements spectral distributions Eq. ( |3.5| ) associated with observables 
on a given incomplete observation level are precisely determined by the measured data. Using the parameterization 
introduced earlier in this section [see Eqs.(|9~20|) and ( |9~22|p4| )] we can write down the expression (|8.16|) for the 



Bayesian a posteriori estimation of the density operator in the limit of large number of measurements. After we 
perform some trivial integrations and when the substitution cos a = x, cos#i = y, cos #2 = z is performed we can 
write the reconstructed density operator as 



1 



x 2 dx I dy dz5((a ( z >) - xy)5((6r z 2) ) - xz)(l ® 1 + xya z ® 1 + xzl ® a z + yza z ® a z ), (9.27) 
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The right-hand side of Eq.( |9.27 ) can easily be integrated over the variables y and z so we can write 



P = JfJ dx f 1 ® 1 + (a*') ® 1 + <^ 2) > 1 ® a z + [CJz ' a z ® a z , . 



(9.28) 



where the integration is performed over the interval C 

C := {— 1, 1} and |x| > s max , 
with s maa; = max{ Ktri 1 ')!, \{<Jz ')| }■ After we perform the integration over the variable x we find 



(9.29) 



1 / / " C 1 ) \ / A ( 2 ) \ 

= - ( 1 ® i + (a z 1} ) a z ® i + (ct< 2) ) 1 $ ff z + K ° z )[az ' 6r z ® ct. 



(9.30) 



Comparing results presented in Tab. VII and (9.30) we see that on the observation level O A the quantum Bayesian 



inference and the Jaynes principle of maximum entropy provides us with the different a posteriori estimations of density 
operators. To be specific, the density operator obtained with the help of the MaxEnt principle can be expressed in a 
factorized form while the density operator /5 a cannot be factorized into a product of two density operators describing 
each spin separately [the only exception is when s max = 1]. 



2. Observation level Og 



(1) S(l)g(2) 



} 



Here we start our discussion with an assumption that only correlations between the particles are measured, while 
the state of the each individual particle after the measurement is unknown (see lines 9-12 in Table IX). In this case we 
are not able to make any nontrivial estimation for the mean values of the spin components of the individual particles. 
In order to have a better estimation wc have also to measure at least one of the spin components of the first or the 
second spin. 

Let us assume that the ^-component of the first spin and the correlation s z s z are measured. That is the z- 
component of the second spin s z is not directly observed. The question is what is the estimation of the density 
operator on this observation level and in particular, what is the estimation for the mean value of the observable s z . 
In Table |ix| (lines 13-16) we present numerical results for the a posteriori estimation of the density based on a finite 
set of "experimental" data. We see that the Bayesian scheme provides us with a nontrivial (i.e., nonzero) estimation of 
the mean value of s i z ) . But the question is whether in the limit of a large number of measurements this is equal to the 
mean value estimated with the help of the Jaynes principle of maximum entropy. The expression for the a posteriori 
Bayes estimation of the density operator in the limit of infinite number of measurements on the given observation 
level [for technicalities see Appendix C] reads 



1 



1®1 



° z i 



I - (1) - (2)x « 



(9.31) 



where s max = max { | (a z 1} )\,\ (a z 1) oi 2> ) | } . Here again the Bayesian a posteriori estimation ( |9.3l|) i s in general different 
from the estimation obtained with the help of the Jaynes MaxEnt principle [see /5b in Tab. [VIl[[ . We see that these 
two results coincide only when s ma x — 1< For in stance, if (a z ®a z ) = 1, then s max is equal to unity and the estimated 
density operators /5b and p given by Eq. ( |9.31 ) are equal and read 



HP 



i 



i 



(9.32) 



In the case when (a z ) = 1 the von Neumann entropy is equal to zero, i.e. 
structed, and is described by the state vector \^>) = | Tit2)- 



the measured state is completely recon- 



(2) 

3. Observation level O c 



{s< 1) ,s< 2) ,^ ) J< 2) } 



Finally, we will consider a measurement of both the spins projections s z 1} , s 
Results of an e stim ation of the density operator based on a sequence of data associated with this observation level are 
given in Tabic IX (lines 17-20). If an infinite number of measurements on the given observation level is performed 
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then we can evaluate the a posteriori density operator analogously to that of the previous example [see Appendix C] 
and after some algebra we find 



1 



dx I dz 



\f a + bz + cz 2 



[1 <g) i + (vi 1} )a z <g> 1 + xzl (g) a z + (d- z r> d- z 2> )a z 



(9.33) 



Due to the presence of the 5- function the integration over the parameter z on the right-hand side of Eq.(3.32) is 
straightforward and we obtain 



1 



dx 



[1 <g> 1 + {a { z ] )a z <g> 1 + (of >)i <g> a z + {a z 1) af ) )a z <g) a z 



N Jc" y / a + bz Q + cZq 

where zq = (a z 2) )/x. From Eq.( |9.34 ) we directly obtain the reconstructed density operator which reads 
p = ~ [i ® 1 + (v ( z 1} )v z <g> i + (oi 2) )l <8> <t z + {a' z 1) a' z ) )a z <g> a z ] . 



(9.34) 



(9.35) 



We see that on the present observation level the density operator ( 9.35 ) estimated via Bayesian inference is equal to 
the density operator pc estimated with Jaynes principle of maximum entropy [see Tab. VII]. Nevertheless, we have 
to remember that the estimation ( 9.3 5| ) based on quantum Bayesian inference is intrinsically related to an averaging 
over a generalized microcanonical ensemble of pure states. On the other hand, the MaxEnt-estimation is associated 
with averaging over the grand canonical ensemble. 



D. Reconstruction of impure spin states 



In this section we apply the purification ansatz as shown in Section VIII C for a reconstruction (estimation) of an 
impure state of a single spin-1/2. To do so, we apply the results of the previous section where we have discussed 
the Bayesian estimation of pure two-spins-1/2 states. In particular, in lines 1-4 of Table [X we present results of 



the estimation of a two-spin density operator based on "results" of measurements of the cr z -component of just one 
spin-1/2. We see that in this case the two-spin density operator can be written in a factorized form, p a {, = p a (8 ^1. 
In this case we can easily trace over the unm easur ed spin and we obtain the estimation for the density operator of 
the first spin (compare with lines 1-4 in Table VIII). This estimation is not based on the a priori purity assumption. 

Comparing results of two estimations which differ by the a priori assumption about the purity of the reconstructed 
state we can conclude the following: 

(1) In general, un der t he purity assumption the reconstruction procedure converges faster (simply compare the two 
columns in Table VIII ) to a particular result. This is easy to understand, because in the case when the purity of 
measured states is a priori assumed, the state space of all possible states is much smaller compared to the state space 
of all possible (pure and impure) states. 

(2) When the measured data are inconsistent with an a priori purity assumption, then estimations based on this 
assumption become incorrect. For instance, for the "measured" data presented in lines 14-17 of Table VIII we find 



that the estimated mean values of <r z diverge from the expected mean value 1/2 (i.e., this is the mean value of <x z 
when we detect in a sequence of AN measurements 3iV spins "up" and N spins "down"). As we have shown in 
Section IX A 3 in the limit N — > oo the reconstruction can completely fail when the purity condition is imposed. In 
the other hand, if it is a priori assumed that the measured state can be in a statistical mixture, then the Bayesian 
quantum inference provides us with estimations which in the limit N — > oo coincide with estimations based on the 
Jaynes principle of maximum entropy. 



1. Observation level = {si 1 '} 



Using the techniques which have been demonstrated in Section VIII we can express the estimated density operator 
on the gi ven o bservation level in the limit N — > oo as [see Eq.( 8.16 )|. We note that on the considered observation 
level, Eq.( [B.l(: ) contains many terms, which are odd functions of the corresponding integration variables. Therefore 
the integration over these parameters (02,4>2,'*P, ( t>i) is straightforward. Moreover, if we perform the trace over the 
"second" (reservoir) spin we can express the density operator of the spin-1/2 under consideration as 



1 



V dy 
1 Jo 



sinflidfli ^(((Tz 1 ') — ycos0i)(l + y cos6>i<r 2 ), 



(9.36) 



67 



where the variable a is substituted by y = cos a. When we perform integration over y we obtain the expression 

2 



_ _ ,Q 8111 Pi 

P ~ AT J c 'cos 2 X | cos 0i | 



(1 + 



with £ defined as 



£:={0, 7r} such that | cos 0i | > | (oi 



(9.37) 



(9.38) 



After we perform the integration over 0i we obtain the expression for the density operator identical to that obtained 
via the Jaynes principle of maximum entropy [see Tab. IvJ. 

2. Observation level = {si 1} , s' x 1} } 



In the limit of infinite number of measurements one can express the Bayesian estimation of the density operator of 
the spin-1/2 on the given observation level as (here the trace over the "reservoir" spin has already been performed): 

P=JfJ y 2d yj sin0id0iy d<f> x 6((a ( z 1} ) - y cos0i)<5((<5i 1) ) - y sin6 1 cos0i) 



x (l + ysin0i cos</>i<T;c + ysin0i smfaay + ycosdia z 
When we perform integration over the variable y we find 
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P = Tr I ^ I dQl — zll"' a 
■A/ J a J a cos 2 0i | cos 0i | 



<5«<7™) - tan 0i cos^of')) 



(9.39) 



x (l + (<t z 1> } tan 0i cos^iCTa; + (oi 1 ') tan 0i sin^iCT^ + (oi 1 ')^) 



The integration over the variable 4>i in the right-hand side of Eq.(9.4C) gives us 

2 



P = 



_L / d9l y 1 — 

M Jc" jri cos 2 0i|sin</>i j) | 



1 + + (v ( z } ) tan 0i shx^ } a y + (&™)& 



where the integration is performed over the interval 



(9.40) 



(9.41) 



£":={0,tt} such that Icos0i| > \{a^)\, and |tan0i| > | [ * ' . 

(a* )\ 



The sum in Eq.(9.41) is performed over two values of the variable <p\ which are equal to the two solutions of the 
equation 



(9.42) 



COS 01 



(of ) )tan0i' 



(9.43) 



Due to the fact that the term in front of the operator a y 1} is the odd function of (pf , we can straightforwardly perform 
in Eq.(9.41) the integration over 0i and we find the expression of the reconstructed density operator which again is 
exactly the same as if we perform the reconstruction with the help of the Jaynes principle [see Tab. 0. 
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Observation level = {§i r> , s^ 1 ' , sS, 1 ' } 



On the complete observation level, the expression for the Bayesian estimation of the density operator of the spin- 
1/2 in the limit of infinite number of measurements can be expressed as (here again we have already traced over the 
"reservoir" degrees of freedoms) [see Eq.(9.39)]: 



p± n 7T p Z7T 

p = J y 2 dyj sm9id9 1 J dfa S((d- Z 1} ) - y cos 9 1 )S((ct ( x 1) ) - y sm9 1 cos<f>i)S({a^) - ysm9 1 sin^ x ) 



x (l + ysin^! cos^ict^ + y sm9 1 sm^ia y + ycos9 1 a z 



Performing similar calculations as in the previous subsection we can rewrite Eq.(9.44) as 



P~ I 777 ^ W^Vtan^sin^a' 1 ')) 

J c cos v\ I sin (jyf | 



(9.44) 



(i + {a^)u x + (a^)ta,n9 1 sm^&y + (ai 1 ')^) 



1- 



(9.45) 



where £" and (j>i are defined by Eqs.(9.42) and (9.43), respectively. Now the integration over 9\ can be easily 
performed and for the density operator of the given spin-1/2 system we find 



^=2 [i + <s£V« 



(a ( z 1} )a z 



(9.46) 



where the mean values (a* 1 ) do not necessarily satisfy the purity condition (9.1). In other words the generalized 



Bayesian scheme provides us with a possibility of reconstruction of impure quantum-mechanical states and the results 
in the limit of infinite number of measurements are equal to those obtained with the help of the Jaynes principle of 
maximum entropy. Moreover, when the quorum of observables is measured, a complete reconstruction of the measured 
state is performed. 



X. OPTIMAL ESTIMATION OF QUANTUM STATES FROM FINITE ENSEMBLES 



In the previous section we have analyzed how quantum states can be reliably estimated from the data obtained in a 
given measurement performed on a finite ensemble of N identically prepared objects in an unknown pure quantum state 
described by a density operator p = \tp)(yj\. In this section we will address the question - What kind of measurement 
provides the best possible estimation of pi (see also |44 This leads to an important problem of the optimal 

state estimation with limited physical resources. It is a generic problem, common to many areas of quantum physics 
ranging from the ultra-precise quantum metrology to eavesdropping in quantum cryptography. 

Within a framework of an elementary group theory the problem of the state estimation can be reformulated as 
a more general problem of estimating an unknown unitary operation from a group of transformations acting on a 
given quantum system (i.e., the state estimation follows as a special case). Holevo |3]| has shown that this problem 
can be solved via the covariant measurement (CM) approach. Unfortunately, the covariant measurement corresponds 
to an infinite (i.e. consisting of infinite continuous set of operators) and therefore physically non-realizable positive 
operator- valued measure measurement (POVM). We note that from the logic of the CM it follows that if any optimal 
measurement (finite or infinite) does exist then using a simple formal construction one can generate from the original 
optimal measurement another measurement which is covariant and which, in the same time conserves optimality of the 
original solution. In the present Section we address the question how to find finite optimal generalized measurements 
if they exist. This is a fundamental question because only finite POVM schemes are experimentally realizable. We 
propose a universal algorithm how to look for these POVM schemes and we apply it explicitly in two physically 
interesting cases of the state estimation of N identically prepared two-level systems (qubits). 

In order to set up the scene, let us assume that state p is generated from a reference state po = \il>o)(ipo\ by a 
unitary operation £/(x) which is an element of a particular unitary finite dimensional representation of a compact Lie 
group G. Different x denote different points of the group (e.g., different angles of rotation in the case of the SU(2)) 
and we assume that all values of x are equally probable. 
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Our task is to design the most general POVM, mathematically described as a set {O r }^ =1 of positive Hermitian 
operators such that O r — 1 PUBOfl, which when applied to the combined system of all N copies provides us with the 
best possible estimation of p (and therefore also of U(x)). We quantify the quality of the state estimation in terms of 
the mean fidelity 



N times 



f = J2 dx ^fir U(.x)p U\x) ® . . . ® (7(x)p C/ t (x)]Tr[C/(x) / 5oi7 t (x) U r faU*\, (10.1) 

r J G 

which corresponds to a particular choice of a cost function |3C| ] used in a context of detection and estimation theory. 
The mean fidelity ( |lQ.l[ ) can be understood as follows: In order to assess how good a chosen measurement is we apply 
it many times simultaneously on all N particles each in state U (x) PqW (x) . The parameter x varies randomly and 
isotropicallyQ over all points of the group G during many runs of the measurement. 

For each result r of the measurement, i.e., for each operator O r , we prescribe the state |i/v) = U r \ipo) representing 
our guess (i.e., estimation) of the original state. The probability of the outcome r is equal to Tr[O r U (x) p U^ (x) (g> 
. . .0 J7(x)/5 t/^(x)], while the corresponding fidelity of our estimation is Tr[U(x) p W (x) U r poU}.}. This fidelity is 
then averaged over all possible outcomes and over many independent runs of the measurement with randomly and 
isotropically distribute d pa rameters x. We want to find the generalized measurement which maximizes the mean 



fidelity / given by Eq.( |lQ.l|) 



The combined system of N identically prepared reference states always remains within the totally symmetric sub- 
space of H k (g) H k (g) . . . H k , where H k is fc-dimensional Hilbert space of the reference state in which the corresponding 
unitary representation C/(x) acts. Thus the dimensionality d of the space in which we construct the POVM {O r } is 
d = ( 1 ). In this case the first trace in Eq.(lO.l) can be rewritten as 



f = J2[ ^[OrU N (x)n Q U N Hx)}Tr[U( X )p UHx)U r p Q Ul}d^ (10.2) 

r •> G 

where U N (x) is a new representation of the same group G; it is equivalent to the ./V-fold symmetrized direct product 
|Sl[ of the original representation U(x). Here U N (x) transforms the ( ^ + 1 fc_1 )-dimensional reference state denoted as 

do. 



We ca n insert the identity operator U^U^ into the first trace in Eq. ( 10.2 ) and, taking into account that in 
Eq.( 10.2| ) we integrate over whole the group G parameterized by x, we can substitute U N (x)U^ — > U N (x) and 



U(x)UJ. — * U(x). Now, using the linearity of the trace operation as well as the linearity of the representation of the 
group G (UpW is a linear adjoint representation) we r ewrite Eq.([10.2[) as 



where 



f = J2Tr[6 r U?FU^}, (10.3) 



U N (x)n U N Hx) Tr[U(x)poUi(x) po]dx, (10.4) 



G 



is a positive Hermitian operator. 

Let us now derive an upper bound on the mean fidelity. Taking into account positivity of the operator F (i.e., 
F = Ai|</>i)(</>i|; ^ 0) an d the completeness condition for POVM (i.e., J2 r ®r = 1) we obtain 

/ = J2Tr[6 r U?FU? t] =^AiTr[a^|<M 

r i r 

< X max J2 TrPr U?\<f>i){<l>i\UW] (10.5) 

i r 

= A mQX ^Tr[6 r U? iU^}=\ max Tr[l] = X max d. 



5 We note that this isotro py co ndition is equivalent to a "no a priori information" condition and is associated with the specific 
integration measure in Eq.(lO.l). This measure has to be invariant under the action of all unitary transformations on the state 
space of pure states (for details see Appendix B). 
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From Eq.( 10.5| ) it clearly follows that the upper bound can be achieved if and only if all operators O r forming the 



POVM satisfy the following conditions: 

i) Each O r is proportional to a suitably rotated (by some U^) projector on the eigenvector of F with the highest 
eigenvalue, i.e. for all O r there exists U 7 N , such that O r — c 2 U T N \4'max){4 l max\U r N ^ . This , or more precisely 
U r \tpo), is our guess associated with the result "r". 

ii) All c 2 are real and positive, to assure that all O r are positive operators. 

Hi) Finally, the operators O r have to satisfy the completeness criterion Y, r c r U r N \(f)max){4 1 mil. i 

As shown by Holevo in the case of the infinite POVM the condition in) is fulfilled for covariant measurements, 
providing the representation U N of the group G is irreducible (see Example A below). This statement follows from 
the Shur lemma. However, in the general case of reducible representation and specifically for finite realizable POVMs 
this argument cannot be used and we have to proceed differently. 

To find the solution of the problem we start with the following observation. Let us assume that we have some 
POVM {O r }^=i an d the corresponding guesses U 7 N which maximize the mean fidelity /. We can always construct 
another POVM with more elements which is also optimal. For example, let us consider a one-parametric subgroup 
U(4>) = cxp(iX<p) of our original group G and choose a basis {\m)}\ i n=1 in which the action of this subgroup is 
equivalent to multiplication by a factor e luJm ^ (i.e., the operator U(cj>) is diagonal in this basis and uj m are eigenvalues 
of the generator X). Then we take d points <p s (s = 1, . . . d) and generate from each original operator O r a set of d 
operators O rs = \jU N (4>s)O r U N1i (<fi s )- In this way we obtain a new set of (d- R) operators such that the mean fidelity 
for this new set of operators, / = J2 rs ^Pn U^ S FU^], is equal to the mean fidelity of the original POVM {O r } 
because we ascribe to each eventual result [r, s] a new guess U r . s = U(<fr s )U r . However, in order to guarantee that the 
new set of operators O rs is indeed a POVM we have to satisfy the completeness condition 

l = J2J2 d r S -J2J2 lu»(<p s )d r u N ^<p s ) = J2T, e " s( 7^" ) £ (^)„ m i™>h ( 10 - 6 ) 

s r s r s m,n r 

Let us notice that, by the appropriate choice of 4> s , the sum — ^ — — can always be made equal to S m . n providing 
all eigenvalues are non-degenerate^ (this is basically a discrete Fourier transform and we illustrate this point in detail 
in Example A). In this case, the conditions ( [10.6 ) for the off-diagonal terms in the basis |m) are trivially satisfied 



whereas the diagonal terms are equal to unity because the original POVM {O r } guarantees that ^2 r (O r ) mm = 1. 
Moreover, even if the original set of operators {O r } does not satisfy the full completeness condition and the conditions 
for the off-diagonal terms are not satisfied (i.e., these operators do not constitute a POVM) we can, using our extension 



ansatz, always construct a proper POVM {O rjS }. This proves that when we maximize the mean fidelity (10.3) it 
is enough to assume d diagonal conditions rather than the original complete set of d 2 constraints for diagonal and 
off-diagonal elements. 

Now we turn back to our original problem of how to construct the POVM which maximizes the mean fidelity. To 
do so we first express the operators O r in the form O r =c 2 . U^\^ r ) (^^1/^^ , where |^ r ) are general normalized states 
in the <i-dimensional space in which the operators O r act, and c 2 . ar e positive constants. This substitution is done 
without any loss of generality]] and it permits us to rewrite Eq.(10.3) so that the mean fidelity / does not explicitly 
depend on , i.e. 

f = J2 4Tr[\* r )(V r \F}. (10.7) 

r 

Obviously, the completeness condition ^2 r O r = 1 is now modified and it reads 

C^|* r )(* r |C^t = i. (10.8) 



6 In the case when the spectrum of the generator X is degenerate, i.e., for some m and n we have ujm — Lo n , then our algorithm 
is still valid, provided we increase a number of Lagr ange multipliers in Eq.(10.9) to account for off-diagonal elements L mn and 
L„m in the definition of the operator L in Eq.( |l0,9 ). 

7 The most general choice of O r would be O r = '^2 i c r ,i \^r,i) (^r,i\- However, from the point of view of optimality of the POVM 



these operators are always less effective than operators O r 
projectors. 



U^ ! \^' r ){^' r \U^'' which are proportional to one- dimensional 
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From our discussion above it follows that when maximizing the mean fidelity (10.7) it is enough to apply only d 
constraints J2 r c r \( m \UN,r\'^r)\ = 1 (here m = 1, ...,d) out of the d 2 constraints (10.8). Therefore to accomplish 
our task we solve a set Lagrange equations with d Lagrange multipliers L m . If we express L m as eigenvalues of the 
operator L = ^ m L m \m)(m\ then we obtain the final very compact set of equations determining the optimal POVM 



F - U^LU? 



|*r)=0, 



1. 



(10.9) 



From here it follows that \*f? r ) are determined as zero-eigenvalue eigenstates. More specifically, they are functions of d 
Lagrange multipliers {£ m }m = i and R vectors {x r }^ =1 [where x r determine U r as U r — U(x r )]. These free parameters 
are in turn related via R conditions Det[(_F — U N *LU N )] = 0. The mean fidelity now is equal to TrL. At this stage we 
solve a system of d linear equations [see the second formula in Eq.( 10.9 )1 for R unknown parameters c 2 . All solutions 
for c 2 parametrically depend on L m and x r which are specified above. We note that the number of free parameters in 
our problem depends on R which has not been specified yet. We choose R such that there are enough free parameters 
so that the mean fidelity is maximized and simultaneously all c 2 are positive. This freedom in the choice of the value 
of R also reflects the fact that there is an infinite number of equivalent (i.e., with the same value of the mean fidelity) 
optimal POVMs. The whole algorithm is completed by finding cf> s from Eq.( 10.6 ) which explicitly determine the finite 
optimal POVM {6<J. 



A. Optimal reconstruction of spin-1/2 states 



Suppose we have N identical copies of spin 1/2 all prepared in the same but unknown pure quantum state. If we 
chose the group G to be U(2), i.e. the complete unitary group transforming a two-level quantum system, we can 
straightforwardly apply the optimal estimation scheme as described above. To be more precise, due to the fact that 
there exist elements of the group U(2) for which the reference state is the fixed point (i.e., it is insensitive to its action) 
we have to work only with the coset space su ^\u(n~i) [0- In the present case this is a subset of the SU(2) group 
parameterized by two Euler angles 8, ijj (the third Euler angle x IS fixed and equal to zero). This subset is isomorphic 
to the Poincare sphere. 

The unitary representation U is now the representation (i) (we use a standard classification of 5/7(2) represen- 
tations, where (J) is the spin number). Its TV-fold symmetrized direct product (we denote this representation as 
U N ) is the representation classified as (-rr) (which transforms a spin-A r /2 particle). Choosing the standard basis 



\j, m) with m 
D* (M,0) = 



-j, . . . j in which the coordinate expression for U(8,ip) corresponds to standard rotation matrices 

-imip 



(8) 1 82 1, we obtain the matrix expression for the operator F 



± m.n — 







II 



* sm(8)d8 
8^ 



(l + cos0)D= „( 



D~ 



N/2 + m+l 
(N + 2)(N + 1)' 



(10.10) 



When we insert this operator in the Eq.(10.5) we immediately find the upper bound on the mean fidelity to be equal 

to 

10 N+2 . 



This is the main result of the paper by Massar and Popescu 44 who noted that this upper bound can be attained 
using the special POVM which consists of an infinite continuous set of operators proportional to isotropically rotated 
projector |y 5 y|- This result is closely related to the covariant measurements of Holevo [Bl[ . 

However, our aim is to construct an optimal and fini te PO VM. To do so, we have to find a finite set of pairs of 
angles {(8 r , Vv)} such that the completeness conditions (10.8) which now take the form 



E 



c 2 e -i Mm -n) d l 



_{8 r ) d*„{8 r ) =8 V 



(10.11) 



are fulfilled. Following our general scheme we first satisfy the completeness conditions (10.11) for diagonal terms 
[compare with Eq.(10.9)] 



E< 



1; 



m= -N/2,... N/2. 



(10.12) 
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s=0,l,...N 




r=0,l,..N 



input 



output 
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FIG. 14. The visualization ol the optimal POV M for measurement of a quantum state of the spin-1/2 from iV identical 
copies as provided by t he s olution in Section XA. On the right the schematic picture of an quantum network (for more 



information see Section XC) as a physical realization of the measuring apparatus which embodies the optimal solution from 



the left part of the picture. 



To satisfy these completeness conditions we choose N + 1 angles 9 r to be equidistantly distrib uted in the (0,7r) 
(obviously, there are many other choices which may suite the purpose - see discussion bel ow Eq .( |10.9| )). Then we 
solve the system of linear equations for N + 1 variables c r . For this choice of r the system ( 10.1 2[ ) has non-negative 
solutions. Finally we satisfy the off-diagonal conditions by choosing N + 1 angles i/> a = jffi for each 8 r . In this case 

IvTT 2~2^=o e l ^ sV = S y ,o for all y = —N/2, . . . N/2 and the off-diagonal conditions are satisfied straightforwardly. This 
concludes the construction of the optimal and finite POVM for the spin-1/2 state estimation. In Fig. |lj we present a 
schematic description of the optimal POVM performed on spin-1/2, while in Fig. [l5| the mean fidelity / as a function 
of number N of measured spins (initially prepared in the same state) is presented. 
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FIG. 15. The mean fidelity of the estimation of a quantum state of the spin-1/2 (lower line) and of a phase shift (upper 
line) based on the optimal POVM (as discussed in this Section) performed on iV identical copies of quantum objects under 
consideration. As expected, for TV large enough the fidelity in both cases becomes equal to unity. 



B. Optimal estimation of phase shifts 

Consider a system of N effectively two level atoms (qubits), all initially prepared in the reference state 771 (|0) + |1)) 
by applying so called ^ pulse to initially deexcited atoms. Then the atoms undergo the free evolution effectively 
described by the U(l) group, i.e. the state of the single qubit evolves as -75 (|0) -I- exp{iip(t)}\l)). Our task is to 
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find a measurement which provides the optimal estimation of the phase ip{t) of the U(l) rotation which carries the 
information about the interaction parameters. 

In the standard classification of representations of the E7(l) group the single isolated qubit is described by the direct 
sum of two one-dimensional representations U = (0) ffi (1). The representation U transforming entire system of N 
qubits is then equal to the direct sum of representations of the form (0) © (1) . . . (N). This acts in the N + 1 
dimensional space spanned by basis vectors |m), m = 0, 1, . . .N. In this basis matrix elements F m ^ n of the operator 



F given by Eq.(10.4) take the form 



± m.n — 



!?r dip \J (iV-m)(iV-n) 



2- 



2 N+1 



itp(n— in ' 



(l+cos^)= V 2N+2 (2S„ 



+ Sm,n+1 + <$m+l,n) • 



(10.13) 



The upper bound on the fidelity Eq. (10.5 ) is now too conservative to be of any use (greater than unity). We can, 
however, solve the system of Eqs. (10.9) which in this particular case of the commutative group reads 



F — L m =0: 



\(m\V)\ 2 = 1; Vm. 



(10.14) 



The condition T)et(F — L) = now determines the eigenvector with the zero eigenvalu e as a function of Lagrange 
multipliers L m . When we substitute this eigenvector into the second equation in Eq.(10.14) we obtain a set of 
equations for L m from which the state 1^) can be determined. The final POVM is then constructed by rotation of \"i>) 
by N + 1 angles <f> s in such a way that all off-diagonal elements of y^ g (O s ) m .„ become equal to zero. This is done in 
exactly the same way as in the example presented above, (see Section X A). The resulting POVM corresponds to the 
von Neumann measurement performed on the composite system of all N ions characterized by the set of orthogonal 
projectors 



P a = |*.)<*.|; 



I*,) = 



1 



VN- 



N 



e -jv+i 



sq 



(10.15) 



and the maximal mean fidelity / is given as the sum: / = 1/2 + 1/2 N+1 Y^i=o y ( i V )( f+i)- We plot this fidelity 
Fig. [l^ (see upper line). 

Finally, we note that the Hermitian operator $ constructed from the optimal POVM ( 10.15| ) 



s-0 



2vr 
N+1' 



■sP, 



(10.16) 



with the corresponding guesses as eigenvalues is identical to the Pegg-Barnett Hermitian phase operator j83|] originally 
introduced within completely different context. 

In conclusion, we have presented a general algorithm for the optimal state estimation from finite ensembles. It 
provides finite POVMs which, following the Neumark theorem |p4{ , can, at least in principle, be implemented as 
simple quantum computations. 



C. Neumark theorem and realization of generalized measurements 



Applicability of many ideas presented in previous parts of this Section critically relies on the assumption that the 
generalized quantum measurements are in principle realizable. This is not obvious, since the typical quantum measure- 
ments, e.g. measurements which may be performed with the help of the Stern-Gerlach apparatus, a photo-detection, 
or a measurement of an atomic population by means of photo- ionization, etc., are all orthogonal measurements.^] Can 
we under this conditions hope to perform a controlled generalized measurement? 



8 This is, of course, an idealization. In practice, we never posses a perfect measuring apparatus. In other words, our mea- 
surements are always subject of an external and uncontrollable noise. Therefore, in principle, we always perform a randomized 
measurements corresponding to non-orthogonal POVMs. 
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To answer this question we start with the following consideration: Assume a quantum object (S) in a state described 
by a density matrix p. Instead of directly measuring it we subject this object to an interaction with another quantum 
object - the ancilla A (see Fig. |l4|). The ancilla is initially prepared in a particular (fixed) quantum state \a). After 
some interaction time the composite system (S + A) evolves into a nontrivial entangled quantum state Up® \a)(a\U^ . 
The orthogonal measurement {P r } is then performed on the composite quantum system. In this case the conditional 
probability distribution p{r\p) defined as 



p{r\p) = Tr. 



S+A 



P r Up® |a)(a|CTt = Tr 5 Tr^ U^P r U 1 % \a)(a\ 



Tr, 



O r f> 



(10.17) 



can be specified. It can be seen, that each projector P r is associated with a new operator O r = Tr^[P r Ul®\a)(a\W}. 
In addition, it is easy to check that the operators O r together compose the nonorthogonal POVM. Therefore, the 
described procedure represents an orthogonal realization of a nonorthogonal POVM. 

Relation between the orthogonal and the nonorthogonal POVMs is actually even more close. As showed by Neu- 
mark Q], not only any particular case of the procedure we have described realizes an nonorthogonal POVM, but also 
the converse is true: 

Theorem (Neumark) Any POVM 0(x) dx defined in the Hilbert space Ti may arise as a restriction of an orthogonal 
POVM E(x) dx in a larger Hilbert space Ti. 



0(x) dx = PE(x)P dx, 



(10.18) 



where P is the projection from Ti onto Ti. 

In what follows we present a construction which proves the restricted version of this theorem. Namely, we will assume 
only a finite-component POVMs {O r } R =1 , where each component is of the form O r = c^|^ r r )(^ r |. For this purpose 
the following construction is suitable: 



Proof (special case) We are looking for a unitary transformation U, which satisfies the condition 



O r = Tr A 



U^P r U 1® \a){a\ 



Vr = 1,2, 



R. 



(10.19) 



It turns out, that for the construction only R of the all d x a (a and d is the number of dimensions of the ancilla 
and the measured system Hilbert spaces, respectively) dimensions of the space Tis+A are relevant. Therefore we will 
construct the unitary operation U only on a subspace Ti R C TLs+a- To have a suitable notation we also divide Hilbert 
space Ti R into two subspaces Ti R = Ti d © Ti R ~ = Ti d © Ti k , where the first coincides with the linear span of the 
vectors |\f r ) ® \a). By inspection, if the unitary operation U is of the form 



U = J2\Pr){'<Pr\ +J2\Pr){Xr\, 



(10.20) 



where \tp r ) = c r \^ r ) ® \a) £ H d and \\ r ) are orthogonal to all |^ s ) (i.e. \\ r ) £ H k ), then Eq. ( 10.1S ) is satisfied. 
Therefore we need to find a proper set of vectors \xr), so that U is indeed a unitary transformation, i.e. 



UU f 



R 

E 

r.s— 1 



{lpr\^s)\Pr)(Ps 



R 

£ 

r.s— 1 



(Xr\Xs)\Pr)(j>s 



A + B = 1. 



(10.21) 



The operators A and B are not diagonal in the basis \p r ). However, a new basis \p r ) can be found in which both 
operators A and B are diagonal. This basis is found by diagonalization of the matrix A rs = (ip r \ip s ) using the suitable 
unitary matrix V rs [i.e., \p r ) = X} s =i Vsr\Ps), A rs — J2i,j=i Ki^-y^js]- Moreover, since the following relations 



{-A 2 )rs — / A r iAi s — {A) r 



i=i 



Tr[i] =J2A SS =d, 



(10.22) 



are satisfied (both as the consequence of the completeness condition of the original POVM) we know that the spectra 

d R-d d R-d 

of the operators A and B are: SpA = {1, 1, . . . 1, 0, 0, . . . 0}; SpB = {0, 0, . . . 0, 1, 1, . . . l l. Th erefore, if we use the 



basis \p r ) instead of the basis \p r ) we can write the unitary transformation U in the Eq. (10.18|) in the form 
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r=l 



-d+1 



(10.23) 



Because the matrices A rs = (i/) r \ipa) an d B rs — (Xr\Xs) are diagonal only those \ip r ) = J2f=i(Ps\Pr)\ips) — 
^2f =1 V sr \ip s ) and \xs) — Sr=i ^ rs \Xr) are nonzero for which r = 1, 2, . . . d and s = d + 1, . . .R, respectively. More- 
over, this also justifies our assumption about the existence of the vectors \x r ) orthogonal to all \ips)- More exactly, as 
we know the vectors \4> r ), T = 1, 2, ... d we can complete them into an orthonormal basis in Ti R by choosing the set 
of vectors \Xs)> s = d + 1, . . . R. Because the vectors with bar are related to the vectors without bar via the unitary 
matrix V sr , the vectors \x r ) and \ips) remain mutually orthogonal. This concludes our proof. 

We note, that there is a freedom in the way how we choose to complete the set of vectors \^) r ), r = 1,2, . . .d by 
the vectors \xr), r = d +1,. . . R, so that they together form an orthonormal basis. The explicit form of the unitary 
operation U from Eq. (10.1 8|) is 



(10.24) 



-d+1 



Therefore we conclude: If we can in the system composed of the original object and the auxiliary system perform a 
particular von Neumann measurement characterized by a set of orthogonal pr ojecto rs \p r )(pr\ and, in addition, we 
can transform this composed system by the unitary transformation U [see Eq. (10.24)], then we can realize a general 
POVM measurement {O r } (for more details see [jg5[). 



XI. INSTEAD OF CONCLUSIONS 



We conclude this paper by a citation from the Jaynes' Brandeis lectures (see p. 183 of Ref. |86j): "Conventional 
quantum theory has provided an answer to the problem of setting up initial state descriptions only in the limiting 
case where measurements of a "complete set of commuting observables" have been made, the density matrix p(0) then 
reducing to the projection operator onto a pure state ^(0) which is the appropriate simultaneous eigenstate of all 
measured quantities. But there is almost no experimental situation in which we really have all this information, and 
before we have a theory able to treat actual experimental situations, existing quantum theory must be supplemented with 
some principle that tells us how to translate, or encode, the results of measurements into a definite state description 
p(0). Note that the problem is not to find p(0) which correctly describes "true physical situation" . That is unknown, 
and always remains so, because of incomplete information. In order to have a usable theory we must ask the much 
more modest question: What p(0) best describes our state of knowledge about the physical situation? " . 
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APPENDIX A 



Conceptually the reconstruction scheme based on the Jaynes principle of the maximum entropy is very simple. 
On the other hand particular analytical calculations can be difficult and in many cases cannot be performed. In 
this appendix we present explicit calculations of generalized canonical density operators (GCDO) and corresponding 



entropies for two observation levels O g and O^' defined in Table [VI. 



(2) 

A. 1. Observation level O a 



Let as assume the observation level Oq" 1 given by the set of observables {a ®*r ; ;4 
fry 1 * 1 <S) &y 2 ^}- In this case the GCDO reads 



,(1)^ ~(2) .(1)^.(2). .(!)_ ~(2).~(1) 



p G =—exp(-E), 



where 



Z n = Tr 



exp ( —E 



is the partition function. Here we have used the abbreviation 

E = X zz a^ ® a™ + \ xx a^ ® a x 2) + \ xy a^ ® ct( 2) + V^' ® <^ 2) + \y^y ] ® of*. 
The corresponding entropy has the form 

5g In ~t~ A^s^ "T" -^xxCaia: ~t~ ^-xyCxy ~t~ Xyx^yx ^yy^iyyi 



(A.l) 



(A.2) 



(A.3) 



(A.4) 



Using the algebraic properties of the operators associated with the given observation level we find the GCDO (A.l) 
to read 



1 



PG 



where we use the notation 



Ztw = (o-W ® oj 2) ^ , (p,v = x,y,z). 



(A.5) 



(A.6) 



Now we express the entropy as a function of expectation values of operators associated with the observation level 

(2) (2) (2) (2) 

O g . With the help of this entropy function we can perform reductions of O g to the observation levels O h , O f 

(2) 

and O e . In order to perform this reduction we express A M „ in Eq. (A.4) as functions of the expectation values 
To do so we utilize the relation 



d\nZ G 



(A.7) 



The partition function Zq can be found when we rewrite the operator E in Eq.(A.4) as a 4x4 matrix: 



E 



fa d* 

-a b* 

b -a 

\d a 



where we used the abbreviations 

d A 22 , b X-xx ^yy i(^xy ^yxji ^ ^xx ^yy ^{^xy ^yx)' 



(A. 



(A.9) 
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The powers of the operator E can be written as 



E n = 



with the matrix elements given by the relations 













\ 





^22 












-^32 


-^33 







Uii 








^44 


/ 



= BW = I[(a+|d|) n + (a-|d|n 



(n) 
14 
(n) 
22 
(n) 
23 
(n) 
32 



l[(a+\d\r-(a-\d\r} w 



^ = ^ [(-« + + (-« - IM)' 
i[(-a + |6|) n -(-o- 



(n)* 
23 
(n)* 
14 



Now we find 



cxp 











e a cosh|6| 
- e °sinh(|6|)A 





e a sinh(|&|)ji_ 
e a cosh |6| 




e a cosh|d| 



e- a sinh(|d|)A 

from which we obtain the expression for the partition function Zq 

Z G = 2e- a cosh |d| + 2e a cosh |6| . 
For the expectation values given by Eq.(A.7) we obtain 

£ zz = -±- [2e' a cosh \d\ - 2e a cosh |6|] ; 



- e - a sinh(|d|)^ 



e~ a cosh \d\ 



l 

Z G 
1 

Z G 



C - 3_ 

^y* ~ z a 
f = L_ 



2e- a sinh(|d|) (X xx - X yy ) + 2e a sinh(|6|)^ (X xx + X yy ) 
2e- a smh(\d\) (X xy + \ yx ) + 2e a sinh(|6|)^ {X xy - X yx ) 
2e- a sinhfldQ^ (X xy + X yx ) - 2e a sinh(|6|)^ (X xy - X yx ) 
-2e- a sinh(|d|)^ T {X xx - X yy ) + 2e a sinh(|6|)^ (X xx + X yy ) 



If we introduce the abbreviations 

B -\- ^ yy i {^ xy 

then with the help of Eq. (A. 14) we obtain 



S,yx ) 7 D — £ xx £yy + i (£ X y + £,yx ) > 



Taking into account that 



wc find 



\B\ = e « s inh(|6|) : 
Zg 



\D\ = — e - a sinh(|d|), 
Zg 



B b D 

lbT""W' W\ 



d_ 

\dl 



Now we introduce four new parameters Mj 

Mi = l+^„ + |D|, M 2 = l+£ z 



\D\ 



78 



M 3 = 1-£ ZZ + \B\, M 4 = 1-£ ZZ -\B\, (A.19) 

in terms of which we can express the von Neumann entropy on the given observation level. Using Eqs. (A. 13), (A. 14) 
and (A. 17) we obtain 



Ah = £ exp (-a + \d\) , M 2 = £ exp (-a - \d\) , 
Ah = 4^ exp (a + \b\) , M 4 = ^ exp (a - |6|) . 

The Lagrange multipliers Xki can be expressed as functions of the expectation values 



(A.20) 



, , /M 3 M 4 \ 4 „ IN fM 3 \ 2 „ lx {MA 2 



After inserting these expressions into Eq. (A. 13) we obtain for the partition function 

4 



(MiM 2 M 3 M 4 

When we insert Eqs. (A. 18), (A. 21) and (A. 22) into Eqs. (A.l), (A. 4) and (A. 12) then we find both the entropy 

4 

'4 V 4 



(A.22) 



i=i 

and the GCDO 





(A.24) 

as functions of the expectation value s Finally, we can rewrite the reconstructed density operator (A.24) in terms 
of the spin operators (see Table VII). 



A. 2. Observation level O h 



(2) 



The GCDO on the 0%> can be obtained as a result of a reduction of the observation level O g . The difference 
between these two observation levels is that the Ojp does not contain the operator , the corresponding 



ds G 1 { MiW 



(2) ■ . . (2) 

^jj^ Coin be obtctincd &>s ° 1 ' 10 ' 1 ^ 1 * ' 11 v/ w ^ 1 1 ' 1 1 1 41 1 ' 1 ' ^ ' 1 ' ! ' ' 11 ' ~' 1 1 ' 11 1 ' ' ^ 1 ^ ''^ 

•vation levels is that the 

mean value is unknown from the measurement 

(2) (2) 
According to the maximum-entropy principle, the observation level O h can be obtained from O g by setting the 

Lagrange multiplier X zz equal to zero. With the help of the relation [see Eq.(A.7)] 

dS G = 1 / M X M 2 
4 n V M 3 M 4 

we obtain 

MiM 2 = M3M4. (A.26) 
From this equation we find the "predicted" mean value of the operator <7z ® cri 2 '' (i.e., the parameter i in Table III) 

£« = \(\D\ 2 -\B\ 2 ) =t (A.27) 
Taking into account that the parameters \B\ and \D\ read 

\B\ 2 = {(, XX + £,yy) 2 + (£ X y - £y X ) , | D | 2 = ((, xx - £yy) 2 + (£ X y + £,y X ) 2 , (A.28) 
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we can express the predicted mean value £ zz as a function of the measured mean values £ xx , £ xy , £ vx and ^ yy : 

^zz — {£,xy^yx £>xx£,yy) • (A. 29) 

When we insert Eq. (A. 27) into Eq. (A. 19) we obtain: 

Mi = AiA 2 , M. = N 3 Ni , M - = A1A3 , Af 4 = A 2 A 4 , (A.30) 
where the parameters Ni arc defined as 

^1 = 1 + 5 (\D\ + \B\) , N 2 = 1 + \ (\D\ - \B\) , 

N 3 = 1-±(\D\-\B\), N< = l-±(\D\ + \B\). (A.31) 
In addition, from Eqs. (A.30) and (A. 23) we obtain the expression for the von Neumann entropy of the density 

(2) 

operator reconstructed on the observation level O h : 

(A.32) 

Finally, from Eqs. (A. 28) and (A. 24) we find the expression for the GCDO on the observation level d§ (see Table [Vl|): 

f>H = \[I {1) ®i {2) +{ixyS,yx~^yy)^ ] ®af ] . 

+ ixxVx 1 ^ <8> O x 2) + Zxy&x^ ® dy 2) + £ yx By^ ® Ox^ + tyy^ <8> & y^ ] ■ 



APPENDIX B: INVARIANT INTEGRATION MEASURE 



In differential geometry the integration measure is a global object - the so called invariant volume form u>. The 
condition that cIq is invariant under the action of each group element U £ SU (n) is equivalent to the requirement 



d„ = d , 



LviUl = 



I = 1, 



,n 2 -l, 



(B.l) 



that the Lie derivative of u> with respect to the fundamental field Vi of action of the group SU (n) in the space fl is 
zero. The vector fields 



V = V?{xi, . . . , z (2n _ 2) )^-; 6=1,2... (2n - 2) 



(B.2) 



are defined via the actions of one-parametric subgroups expfitS'i) C SU(n), t € R (one action for each generator Si 
On the other hand the elements of the space f2 [see Eq.( 9.20 )l have a structure 



P{%1, ■ ■ .,a!{2n-2)) = - + P(%1, ■ ■ ■ ,X(2n-2))Si, 



(B.3) 



where Si are n 2 — 1 linearly independent, zero-trace, Hermitian, n x n matrixes, i.e. they are generators of the SU(n) 
group. Due to this we can express the vector fields Vi 



d 



d 



1 dx^ dt 



exp(itSi)pexp(—itSi) 



as the solutions of the following equation: 



B 

V b —f k = i c k f J 
1 dx b J l]J 



(B.4) 



(B.5) 
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The complex numbers c£ are the coefficients in commutation relations [Si,Sj] = c^Sk- We note, that Eq. ( |B.5| ) 
represents for each fixed index i an overdetermined system of n 2 — 1 linear equations for 2n — 2 unknown func- 
tions (the fact that this system is consistent confirms the corr ectness of our parameterization of the state 
space Q). Finally, we present an explicit coordinate form of Eq.(B.l), which determines the invariant volume form 
lu = m(xi, . . . , X( 2n _2)) A dx\ A ... A dxnn-2) as the solution of a system of partial differential equations: 



(B.6) 



Here we note, that mVi in Eq.(B.6) has the meaning of a "flow" of the density of st ates generated by unitary 
transformations associated with the z-th generator. From the physical point of view Eq.(B.6) means that the divergence 
of this flow is zero, i.e. the number of states in each (confined) volume element is constant. 

As an illustration of the abo ve d iscussion we firstly evaluate the invariant measure for the state space of a single 
spin- 1/2. Using the definition (BJ5) we find the fundamental field of action Vi (« = 1,2,3) for the three generators of 
the SU{2) group: 



V\ = cos(</>) cot(0)<90 + sm(4>)de; V 2 = sin(</>) cot(#)<90 - cos(^>)<9g; V 3 = -d<p. (B.7) 
We substitute these generators into Eq.([B.6|) and after some algebra we obtain the system of differential equations: 



d n 
— -m = U 



— m = mcot(0), 



(B- 



which can be easily solved, 



m(0,(j>) = const sin(0). 



(B.9) 



The multiplicative factor is given by the normalization condition. This is the route to derive the integration measure 
of the Poincare sphere. Analogously we evaluate the invariant integration meas ure fo r a state space of two spins-1/2. 
The calculations are technically more involved, but the result is simple see Eq. (9.22). 



APPENDIX C: BAYESIAN INFERENCE ON IN THE LIMIT OF INFINITE NUMBER OF 

MEASUREMENTS 

On the given observation level we can express the estimated density operator in the limit of infinite number of 
measurements as 



1 



2tt r l 



l> = j- J d^jj x 2 dxj\yj\zS^ai 1) )~xy)S((^ai 2) )-yz+[(l-x 2 )(l-y^ 



x < 1 ® 1 + xya z ® 1 + xzl ®a z + 



yz- ((l-a; 2 )(l-y 2 )(l-z 2 )) 1/2 cos^] a z ®a^ 



(C.l) 



When we integrate Eq.(C.l) over the variable y and we obtain 



(l-x 2 )(l-z 2 ) 1 



1/2 



COS ifj 



1 + xzl ® a z + 



(1-^(1-^)1- 



(^) Jul/J 



cos^) 



(C.2) 



where the integration boundaries are defined as 



81 



£":={-l,l} and \x\>\(a^)\. 
Now we will integrate Eq.( |C.2| ) over the variable ip. There are two values ip^ U — 1j 2) of ip, such that 



COS "00 



providing that inequality 



1 > 



[(l-^Xi-^Xi-^n/*) 2 )] 



(^ 1, ( Tf , )-<<Ti 1, )2/x 



1/2 ' 



(C.3) 



(C.4) 



[(l-* 2 )(l-z 2 )(l-(<^x) 2 )] 



1/2 



(C.5) 



holds. The last relation can be rewritten as the condition a + bz + cz 2 > 0, where the explicit forms of the coefficients 
a, b, and c are: 

a = 1 - (a^f/x 2 + (a^) 2 - (a' 1 '^ 2 '} 2 - x 2 ; 



b = 2(a z 1) )(a z 1) a z 2 ')/ 



c = x 2 -(a^) 2 -l. 



(C.6) 



The coefficient c is always negative, which means that we have a new condition for the parameter z, that is z S (zi, 2:2), 
where z\ and Z2 are two roots of the quadratic equation a + bz + cz 2 — 0. H owever, these roots exist only providing 
the discriminant b 2 — Aac > is nonnegative. Taking into account Eq.(C.6) we see that the last relation is a cubic 
equation with respect to the variable x 2 , which imposes a new condition on the integration parameter x. That is, the 
interval C" through which the integration over x in Eq.(p.2|) is performed is defined as 



C" 



'1(^)1,1} 



for Ka^'a' 2 ')! < \(a™)\; 



z vz 

'|( ( t< 1) )|,V1 + (^ 1) ) 2 -(^ 1, ^ 2) ) 2 } for K^aDl^Kai 1 '}! 



Taking into account all conditions imposed on parameters of integration we can rewrite Eq.( C2 ) as 



Af 



dz 



\x\ J Zl Va + bz + cz* 



rdx 



(1 (g) 1 + }a z ®l + xzl ®a z + (ai 1} trf > )a z ® a z ). 



(C.7) 



(C.8) 



Using standard formulas [see, for example, [fT4[ , Eq.(2.26f) and Eq. (2.264)] the integration over parameter z in Eq.( |C.8| ) 
can now be performed and we obtain 



1 



1 



'' Mj c „ dx \x\ (1 + -x*) 



r (i <g> i + (a i z 1) }a z ® 1 + <g> a z ) 



dx 



M (i + ^i 1 ') 2 -x 2 y 



-(1® a z ). 



(C.9) 



After performing integration over x in Eq. (|C.9| ) we obtain final the expression ( ^3lj) for the a posteriori estimation 
of the density operator on the given observation level. 
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